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ABSTRACT 


Title  of  Dissertation:  Asymptotic  Nonlinear  Filtering  and 

Large  Deviations  with  Application 
to  Observer  Design 

Matthew  Ronald  James,  Doctor  of  Philosophy,  1988 

Dissertation  directed  by:  J.  S.  Baras 

Professor,  Department  of  Electrical  Engineering, 
and  Director,  Systems  Research  Center 
L.  C.  Evans 

Professor,  Department  of  Mathematics 


An  important  problem  in  control  theory  is  the  design  of  observers  for  non¬ 
linear  control  systems.  By  observer  we  mean  a  deterministic  dynamical  system 
which  uses  observed  information  to  compute  an  estimate  of  the  state  of  the 
control  system  in  such  a  way  that  the  error  decays  to  zero.  Baras  and  Krish- 
naprasad  have  proposed  that  an  observer  design  might  result  from  a  study  of  an 
asymptotic  nonlinear  filtering  problem  obtained  by  adding  small  noise  terms  to 
the  equations  defining  the  control  system.  The  purpose  of  this  thesis  is  to  study 
this  asymptotic  filtering  problem  and  to  develop  observer  designs  based  on  their 
idea. 


Asymptotic  nonlinear  filtering  problems  have  been  studied  by  several  authors, 
and  are  closely  related  to  large  deviations  (Wentzell-Freidlin  theory).  We  prove 
using  vanishing  viscosity  and  control  theoretic  methods  a  logarithmic  limit  result 
for  solutions  of  the  Zakai  equation.  This  limit  is  characterised  by  a  Hamilton- 
Jacobi  equation  which,  as  noted  by  Hijab,  arises  in  Mortensen’s  deterministic 
minimum  energy  estimation.  We  make  a  careful  study  of  this  equation  in  the 
light  of  the  relatively  recent  theory  of  viscosity  solutions  due  to  Crandall  and 
Lions.  We  study  the  weak  limit  of  the  conditional  measures  and  filters.  Inspired 
by  Hijab’s  large  deviation  result  for  pathwise  conditional  measures,  we  obtain  a 
large  deviation  principle  “in  probability”  for  the  conditional  measures,  and  also 
a  large  deviation  principle  for  the  distributions  of  these  measures. 

This  asymptotic  analysis  suggests  that  the  limiting  filter  is  a  candidate  ob¬ 
server.  We  present  an  exact  infinite  dimensional  observer  for  uncontrolled  ob¬ 
servable  systems.  In  the  case  of  uncontrolled  nonlinear  dynamics  and  linear 
observations,  Bensoussan  obtained  a  finite  dimensional  observer  which  is  an  ap¬ 
proximation  to  the  limiting  filter.  A  detectability  condition  was  used  to  prove 
exponential  decay  of  the  error,  provided  the  initial  condition  lies  in  a  bounded 
region.  We  extend  his  approach  to  the  general  case  of  controlled  nonlinear  dy¬ 
namics  and  nonlinear  observation.  In  particular,  we  obtain  an  observer  for  a 
class  of  fully  nonlinear  systems  with  no  constraints  on  the  initial  conditions. 
The  Benes  case  is  considered. 
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Chapter  1 
Introduction 


An  important  problem  in  control  theory  is  the  construction  of  observers  for 
nonlinear  control  systems,  say  of  the  form 

x  =  f(x,u ),  x-(0)  =  :r0,  (1.1) 

y  =  Hx)i 

where  x  £  lRn  and  y  £  1R.  Here,  the  initial  condition  Xo  is  unknown  and  y 
represents  observations  or  partial  information  regarding  the  state  trajectory. 

The  observer  problem  consists  of  computing  an  estimate  x(t)  of  x(t),  for 
which  the  error  decays  to  zero  as  t  — >  oo;  that  is,  to  design  a  system 

m  ~  F(m,u,y),  m(0)  =  mo,  (1.2) 

x  =  G(m), 

where  m  £  M.  and  x  £  JRn ,  such  that  (for  example) 

|  x{t)  —  x{t)  |<  (7e_7<,  7  >  0.  (1.3) 

The  observer  state  space  M.  need  not  necessarily  be  finite  dimensional.  One 
seeks  methods  for  constructing  the  system  (1.2)  from  the  data  /,  h  given  for 
(1.1). 

When  the  control  system  (1.1)  is  linear  (/(x,u)  =  Ax  +  Bu,  h(x)  =  Cx ), 
the  observer  problem  was  solved  by  Luenberger  [44].  To  date  various  ad  hoc 
methods  have  been  applied  in  the  general  nonlinear  case  to  certain  special  cases 
(eg.  linearisation,  choosing  convenient  local  coordinates,  etc.),  and  no  general 
methodology  has  appeared. 


1 


Baras  and  Krishnaprasad  [1]  have  proposed  a  method  employing  nonlinear 
filtering  asymptotics  based  on  the  informal  idea  that  “nonlinear  observer  =  limit 
of  nonlinear  filters” .  This  is  a  probabilistic  approach  to  a  deterministic  problem, 
where  one  uses  additional  probabilistic  structure  in  an  effort  to  gain  insight  into 
how  to  solve  the  problem. 

Associate  with  (1.1)  the  family  of  filtering  problems 

dxe(t )  =  f(xe(t),u(t))dt  +  ^Ndw(t),  a;£(0)  =  xc0,  (1.4) 

dyc's(t)  =  h{x\t))dt  +  V6Rdv(t ),  y£,5(0)  =  0, 

where  w,  v  are  independent  standard  Wiener  processes,  Xq  is  a  random  variable 
with  distribution  /j,q  and  density  Ce  exp(  —  Sq(x)/c).  Since  Xq  is  unknown,  So  is 
chosen  so  that  //„  ^m0  as  e  — >  0  for  some  m0  G  Mn.  The  matrices  N,  R  are 

inserted  to  give  extra  flexibility  in  the  design  (these  are  called  design  parameters); 
we  assume  that  rankAf  =  n  and  R  >  0.  Then  as  e,  8  — >  0,  the  trajectories  of 
(1.4)  converge  in  probability  to  the  trajectories  of  (1.1)  corresponding  to  the 
initial  condition  x(0)  =  m0. 

To  obtain  a  nonlinear  observer  for  (1.1),  one  then  studies  the  limit  of  the 
nonlinear  filters  for  (1.4)  and  identifies  the  limiting  filter  (if  indeed  one  exists). 
Having  accomplished  this,  one  then  must  try  to  compute  the  error  between  the 
estimate  given  by  the  limiting  filter  and  the  actual  deterministic  trajectory  as 
t  — >  oo.  Thus  this  approach  involves  two  kinds  of  asymptotics:  small  parameter 
and  large  time.  The  latter  is  very  difficult  in  general,  and  is  related  to  the 
observability  or  detectability  of  the  control  system  (1.1). 

Asymptotic  nonlinear  filtering  problems  of  the  type  (1.4)  have  been  studied 
by  a  number  of  authors.  Picard  [51],  Bensoussan  [4],  and  Ji  [31]  have  treated 
the  case  e  =  1  and  8  — >  0.  Using  a  suboptimal  filter  (similar  to  the  extended 
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Kalman  filter)  they  obtained  by  various  techniques  uniform  in  time  estimates 
of  the  error.  Rather  strong  assumptions  were  made  regarding  the  function  h , 
stronger  than  observability  or  detectability.  The  suboptimal  filter  has  a  1/8 
singularity,  making  it  difficult  to  identify  any  limiting  filter.  In  any  case,  the 
limit  would  not  be  deterministic. 

Hijab  [25]  treated  the  case  e  —  6  — >  0,  and  obtained  a  WKB  expansion 
of  pathwise  unnormalised  conditional  measures  under  certain  smoothness  as¬ 
sumptions.  In  [26],  Hijab  derived  a  large  deviation  principle  for  the  pathwise 
conditional  measures.  His  work  in  part  provided  the  motivation  for  the  asymp¬ 
totic  filtering  method  for  observer  design,  since  it  identifies  the  limilting  filter. 
Mitter  [47]  also  posed  this  asymptotic  filtering  problem.  This  is  the  case  stud¬ 
ied  in  detail  in  this  dissertation.  We  shall  make  some  comments  on  the  other 
cases  in  Section  7.3.  We  give  a  new  and  detailed  analysis  of  this  problem  using 
both  PDE  and  probabilistic  techniques.  A  logarithmic  limit  result  for  the  un¬ 
normalised  conditional  densities  is  obtained,  and  a  large  deviation  principle  “in 
probability”  is  established  for  the  conditional  measures.  We  also  study  the  weak 
limit  of  these  conditional  measures  and  of  the  nonlinear  filters. 

We  now  indicate  how  our  analysis  is  carried  out.  To  simplify  the  discussion, 
consider  now 


i(t)  =  f{x{t)),  *(0)  =  x0,  (1.5) 

y(t)  =  h(x(t)),  ?/(0)  =  0, 

with  the  associated  noisy  system 

clxe(t)  =  f(x€(t))dt  +  \Zedw(t),  af(0)  =  xc0,  (1-6) 

dyc(t )  =  h(xc(t))dt  +  \/edv(t),  i/(0)  =  0. 
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(1.7) 


The  Zakai  equation  for  an  unnormalised  conditional  density  qe(x,t)  is 

dqc  =  A*qcdt  4 — hqcdye , 
qe{x,  0)  =  Ctexp(-S0(x)/e), 

where  Ae  is  the  generator  of  the  Markov  process  x\. 

Formally  applying  the  logarithmic  transformation 

Wc(x,t)  =  -e\ogqc(x,t), 

we  obtain  a  nonlinear  parabolic  PDE  for  Wc.  Sending  e  — >  0,  then  ?/e  — >  r/  and 
Wc  — v  W,  where  W  satisfies  (in  the  viscosity  sense)  the  Hamilton- Jacobi  (HJ) 
equation 

§-tw  +  S  { DW  ■  (/(*) +  u)  “  Q I u  I2  +\h(x)2  ~  vCOM*)) }  =  °> 

IH(M)  =  So(x).  (1.8) 

Using  the  vanishing  viscosity  method  developed  by  Crandall-Lions  [6]  and  Evans- 
Ishii  [16],  we  obtain 

q€{x,t)  =  exp  ^  [W(x,t)  4-  o(l)]^ 

in  probability  as  e  — »  0,  uniformly  on  compact  subsets  of  JRn  x  [0,2’].  This  is 
proved  rigorously  via  the  “robust”  Zakai  equation. 

The  HJ  equation  (1.8)  is  parameterised  by  the  observation  path  ?/(•),  and 
characterises  the  deterministic  limiting  filter  since  it  can  be  interpreted  as  the 
Bellman  equation  for  an  optimal  control  problem  arising  in  Mortensen’s  deter¬ 
ministic  minimum  energy  estimation.  Consider  the  auxilliary  control  system 

x  =  f(x)+u,  (1.9) 

V  =  h(x)  +  v, 


4 


where  u ,  v  are  deterministic  functions  modelling  noise.  Viewing  u  as  a  control 
and  fixing  y,  we  have 

W(x,t)  =  (1.10) 

inf„(.)  {So(z«(0))  +  fg  \  |  u(s)  |2  +  |h(:ru(.s))2  -  y(s)h(xu(s))ds  :  xu(t)  =  a;} 

Noting  that  (h(x)-y(t))2  =  h(x)2  —  2y(t)h(x)  +  y(t)2,  we  see  that  inf^eR**  W(x,t) 
is  a  measure  of  the  minimum  energy  required  for  (1.9)  to  produce  the  given 
output  y(s)  :  0  <  s  <  t.  If  W(-,t)  has  a  unique  minimiser  x(t),  we  call  x(t)  the 
deterministic  estimate  given  y(s)  :  0  <  s  <  t.  This  defines  a  deterministic  filter 
7ft [?/(•)].  In  fact,  we  show  that 

as  e  — ■>  0,  where  7fte  is  the  stochastic  filter  defined  pathwise. 

Turning  now  to  the  second  approach,  the  Kallianpur-Striebel  formula  gives 
a  representation  for  the  unnormalised  conditional  measures  c\\ 

*1(4)  =  E^x^exp{~\[\J0  Kx\Yds- Jo  h(xl)dyl  )  (i-n) 

for  all  <f>  €  Cb(JRn).  Using  an  extension  of  a  theorem  of  Varadhan,  we  obtain  the 
large  deviation  type  estimate 

*t(A)  x  exP  (~^inf  IU(a;,t)) 

in  probability  as  e  — i  0,  for  Borel  sets  A  C  Mn.  The  symbol  “x”  denotes 
logarithmic  equivalence.  This  result  allows  us  to  prove  a  large  deviation  principle 
(in  a  generalised  sense)  for  the  conditional  measures  =  cr^/a^(lRn),  and  to 
show  that 

7ft  6x(t) 
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ill  probability  as  e  — >  0,  where  x(-)  is  the  deterministic  trajectory. 

We  also  obtain  a  large  deviation  principle  for  the  distributions  of  the  condi¬ 
tional  measures  xf  This  is  a  second  level  of  large  deviation  behaviour  for  our 
asymptotic  filtering  problem. 

We  can  explain  formally  why  the  deterministic  estimator  arises  naturally  in 
the  context  of  large  deviations  as  follows.  In  the  theory  of  large  deviations, 
asymptotic  probabilities  are  characterised  in  terms  of  variational  problems.  The 
Wentzell-Freidlin  theory  [20]  considers  small  random  perturbations  of  dynamical 
systems  (see  Section  2.2.2).  If  Pf  is  the  distribution  of  Xc  =  {ah(,s)  :  0  <  s  <  T} 
on  Q"  =  C([0,  T],  IRn),  then  one  has 

Pex{A)  x  e-7inf*eA/*(0) 

for  Borel  sets  A  C  fln,  where  the  action  function  lx  is  given  by 

1  T 

lx(0 )  =  SO(0O)  +  -  [  |  0S-  f(0s )  |2  ds 

J  0 

for  absolutely  continuous  0  £  fln,  and  infinity  otherwise.  Thus,  heuristically, 

Pex(d0 )  X 

Now  let  Py\x  denote  the  conditional  distribution  of  Ye  =  {ye(-s)  :  0  <  s  <  T} 
given  Xt  on  Oo  =  {r/  £  C([0,T],  1R )  :  77(0)  =  0).  Then  we  have 

Pf\x(drj  |  0)  x  e~^foT^-h^'>\2ds^dri\ 

We  can  compute  P)C\Yi  the  conditional  probability  of  Xe  given  Yc  on  fln  using 
Bayes’  rule: 

Pk[r(A  I  V) 

_  Ja  exP  (  2e  fo  1  Vs  ~  K9»)  I2  ds)  Px(d6) 
inn  exp  (~yJo  I  Vs  -  h{0s)  |2  ds)  P^(dO) 
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I A exP  ( 

1 

<  € 

So(0o)  +  fol\(>s-  f(0s)  |2  +lh(9sy  -  psh{ds)ds 

)  sur 

Inn  exp 

H 

[s^o)  +  lo  l  1  0*  -  SiPa)  I2  A\h{Bsy  ~  1 )sh(ds)ds 

) 

\  “ dd ” 

The  expression  in  the  exponent  is  just  the  quantity  being  minimised  in  (1.10), 
and  the  above  implies  that  the  measures  Px\y  concentrate  on  the  optimal  tra¬ 
jectory  as  e  — »  0. 

Finally,  we  turn  to  our  observer  designs.  In  general,  we  cannot  obtain  the 
deterministic  estimate  x(t)  from  a  finite  set  of  ODEs,  so  we  must  regard  (1.8) 
as  giving  the  (infinite  dimensional)  dynamics  of  the  limiting  filter.  One  notable 
exception  is  the  Benes  class  of  systems,  which  admit  finite  dimensional  filters. 
An  exact  infinite  dimensional  observer  is  obtained  by  setting  So  =  0  in  (1.8).  If 
we  assume  that  (1.5)  is  observable  on  [0,  T],  then  x(t)  =  x(t)  for  all  f  >  0  and 
all  x0  €  JRn. 

We  obtain  a  finite  dimensional  observer  design  in  the  spirit  of  extended 
Kalman  filtering.  We  use  an  approximate  deterministic  filter  which  is  defined 
by  an  ODE  and  a  Riccati  equation  as  the  basis  for  our  observer  design: 

m(t)  =  f(m(t))  +  Q(t)H(m(t))'(i/(t)  —  h(m(t))), 

Q(t)  =  A(m(t))Q(t)  +  Q(t)A(m(t))'-Q(t)H(m(t)yH(m(t))Q(t)  +  I 

where  A{x)  =  Df(x),  H(x)  =  Dh(x).  This  approach  was  initiated  by  A. 
Bensoussan  in  [2]  for  the  case  h(x)  —  Cx. 

If  the  nonlinearities  are  small,  then  we  expect  that  x(t)  ~  m(t).  The  large 
time  behaviour  of  the  error  x(t)  —  m(t)  depends  critically  on  the  initial  conditions 
Xo,  m0  and  on  the  growth  of  Q(t).  Under  a  uniform  detectability  condition, 
which  is  similar  to  the  well  known  detectability  condition  for  linear  systems,  one 
can  bound  ||  Q(t)  ||  uniformly.  Then  if  |  x0  —  m0  |<  p ,  where  p  depends  on 
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the  nonlinearities  and  design  parameters,  the  error  decays  exponentially  to  zero. 
The  Lyapunov  function  x'Qit^x  is  used  to  prove  this. 
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Chapter  2 

Probabilistic  Framework 

Some  basic  results  from  the  theories  of  nonlinear  filtering  and  large  deviations 
are  recalled  below.  There  is  now  a  substantial  literature  devoted  to  these  sub¬ 
jects,  and  in  this  chapter  we  present  briefly  the  framework  required  for  treating 
our  asymptotic  nonlinear  filtering  problem.  In  addition,  we  prove  some  auxilliary 
results  for  subsequent  use. 

2.1  The  Filtering  Problem 

Let  P)  be  a  complete  probability  space  with  a  filtration  {Ft},  on  which 

are  defined  two  independent  standard  Wiener  processes  {wt,  Ftj,  {t>t,  Ft}  taking 
values  in  ]Rn  and  IR  respectively.  Defined  also  is  an  TRn  valued  random  variable 
xe0  independent  of  the  Wiener  processes  with  distribution  pjp  which  is  assumed 
to  have  a  density 

pSM  =  C<e-"s"w  (2.1) 

where  is  a  normalisation  constant,  So  is  Lipshitz  continuous,  convex,  and 
S0(x0)  =  0,  S0(x)  >  0  if  x  xQ  G  lRn. 

We  consider  a  pair  of  stochastic  differential  equations  (SDE) 

dxe(t)  =  f(xe(t))dt  +  y/edw(t),  xe(0)  =  xc0,  (2.2) 

dye(t)  =  h(xc(t))dt  +  y/ldv(t),  i/e(0)  =  0.  (2.3) 

Here  we  assume  /  G  Cl(IRn ,  IRn)  and  h  G  C{j(lRn,IR).  Thus  there  exists  a 
unique  solution  of  (2.2),  (2.3)  in  the  strong  sense  on  (H,^7,  P).  Then  {x\}  is  a 
Markov  process  with  generator 

Ae  =  |A  +  f(x)  •  D, 
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where  A  is  the  Laplacian  and  D  denotes  gradient  in  the  x  variable. 

The  process 

XI  =  {rr£(s)  :  0  <  s  <  t} 

is  called  the  signal  or  state  process,  and  cannot  be  observed  directly.  Rather,  we 
have  available  the  observation  process 

v;  =  {</M  :  0<s<(}, 

which  consists  of  a  function  of  the  state  plus  noise.  The  filtering  problem  is 
concerned  with  estimating  a  functional  of  the  state  process  using  the  information 
contained  in  the  observation  process.  This  information  is  encoded  in  the  filtration 
{T* }  defined  by 

Yt  =  *07). 

the  cr-algebra  generated  by  the  observations  up  to  time  t. 

Let  <f>  €  Bb{lRn).  The  “best”  estimate  of  given  3^  is  the  conditional 

expectation 

=  mo  i  xi- 

We  use  the  notation  x\  for  the  conditional  mean  E[x\  \  3^]-  The  task  of  filtering 
theory  is  to  obtain  representations  and  recursive  equations  for  computing  quan¬ 
tities  such  as  7Tj(^).  We  refer  the  reader  to  the  books  by  Elliott  [13],  Kallianpur 
[34],  and  Liptser  and  Shiryayev  [43]  for  further  details. 

One  can  study  filtering  problems  in  more  general  settings.  We  choose  not  to 
do  so  here,  prefering  to  focus  on  the  central  ideas.  The  methods  used  in  this 
thesis  can  be  extended  if  desired. 
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2.1.1  Kallianpur-Striebel  Formula 

Fix  T  >  0.  For  t  6  [0,  T]  define 

nr  =  c([o,t],mn),  sio,t  =  {veC([o,t],m):r,(o)  =  o}t 

and  equip  these  spaces  with  the  uniform  topology.  We  shall  always  use  ||  •  || 
to  denote  the  uniform  norm.  For  t  =  T  we  write  simply  Sln  =  etc.  This 
convention  is  used  in  other  notation  below  (c.f.  Liptser  and  Shiryayev  [43], 
Chapter  7).  £?(X)  denotes  the  Borel  er-algebra  of  a  topological  space  X. 

If  (X,  d)  is  a  metric  space,  then  ('P(X),  g)  is  the  space  of  probability  measures 
on  (X,  B(X))  where  g  is  the  Prohorov  metric  (Ethier  and  Kurtz  [15],  Chapter  3): 

q(vi,  v2)  =  inf  {<?>  >  0  :  V\(C)  <  v2(Cs )  +  6  for  all  closed  C  C  Xj 

for  zq,  v>2  €  'P(X),  where 

C6  —  |x  €  X  :  inf  d(x,  y)  <  . 

If  (X,d)  is  complete  and  separable,  then  so  is  ('P(X),^),  and  convergence  in 
('P(X),  g)  is  equivalent  to  weak  convergence: 

=>  v  as  i  — *  oo 


if  and  only  if  for  all  p>  £  (7f,(X) 


lim  /  pdvi 
i-  oo  Jx 


The  random  variables  X[,  Yte  defined  above  take  values  in  O']1,  Sl0,t  respec¬ 
tively.  Define 


Pex(A)  =  P(Xe  e  A)  {AeB(Sln)). 
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The  following  theorem  gives  a  functional  integral  representation  for  a  condi¬ 
tional  expectation,  and  is  known  as  the  Kallianpur-Striebel  formula.  Refer  to 
Kallianpur  and  Striebel  [33],  Kallianpur  [34]  and  Liptser  and  Shiryayev  [43]. 


Theorem  ( Kallianpur-Striebel  Formula  )  Let  $  £  M) .  Then 


£[$m  |  y](u) 


fQnA<(0,Y^))P^d0) 


(2.4) 


where 


A 1(0, rj)  =  exp  7^  Jo  h(0s)2ds-  h(Os)dr)s  ^  (0  £  ftn,  T)  £  Ll0)  (2.5) 


We  use  this  formula  to  define  several  random  measures  as  follows.  Define 


n'(A)M 


/  A‘(0,y»)Pi(c»)  (A  6  B(fi”),  u>  6  SI), 
E<(fl")(w)' 


(2.6) 

(2.7) 


Thus  IT  is  a  regular  version  of  the  conditional  probability: 


P(XC  £  A  |  T£)H  =  IT(A)H  P-a.s.,  (A  £  B(Qn)). 


We  are  also  interested  in  corresponding  conditional  measures  on  IRn .  Let 
t  £  [0,T]  and  define 


<(A)(u>) 


£e({0  :  0t  6  A})(w)  (A  £  B(lRn),  weft),  (2.8) 


So  7re  is  a  regular  version  of  the  conditional  probability: 


P{x\£  A  I  yi)  (w)  =  <(A)(w)  P-a.s. 
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(A  e  B(Mn)), 


and  if  (j>  G  Bb(Mn)  we  have 


<m^)  =  EP[<f>(xl)  \  yt](u)  (2.10) 

jg  m)K(8,y‘M)Px(de) 

Sa„K((),Y‘(u))PHJ0) 

The  measures  (£e,  IP,  irct  are  (unnormalised)  conditional  measures. 

7re  is  a  random  measure : 


<  :  0  ->■  V(lRn ) 

LO  l— >  7Tj(o;). 


(2.11) 


Pathwise  or  robust  analogues  of  these  measures,  which  we  next  define,  are 
useful  from  a  practical  point  of  view,  as  well  as  for  asymptotic  analysis  (Hijab 
[25],  [26],  and  Chapter  5  herein).  Refer  to  Davis  [11],  Sussmann  [53],  and  others. 

Using  the  integration  by  parts  formula  and  Girsanov’s  theorem,  we  proceed 
as  follows: 


/  *(*)A t(o,Yt(u>))P'x(<w)  = 

Jun 

f  $(A"£(<n))exp  [  h(x\(Co))2 ds  —  f  h(xl(u))dyl(u>) 

JQ  \  6  j  2  JO  Jo 


P(dcb) 


=  J^{Xc(Co))exp^-~  -^(w)l((4(w))  +  Ve(xcs(u),yl(u))ds 
rT(Xt(u}),Yc(Lo),Q)P(dCb)  P-a.s. 

=  m^(0,Y^))P(YHuj)](dd),  (2.12) 


where 

Ve(x,  y)  =  ^h(x)2  +  yAth{x)  -  ^y2  |  Dh(x)  |2  (x  G  JR",  y  G  1R),  (2.13) 


r  1(0, y,u) 


exp 


-^/eL  VsDhiOs^xUsiu)  -  -^Jo  vl  I  Dh(0s)  |2  ds 


L 

{0  GO",  y  G  O0,  u>  G  O), 


(2.14) 
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(0,  rj)  —  exp  ^ —  — ?fifi(0t)  +  JQ  Vt{Osirls)ds  ^  (0  G  ft”,  r]  E  Do)-  (2.15) 


A? 


Here  P^  is  the  distribution  on  Hn  of  the  diffusion 


dx£(f)  =  g(xc(t),y(t))dt  +  y/edw(t),  P(0) 


Lc» 


where 

g(x,y)  =  f(x)-yDh(x)'  (x  E  JRn,y  E  IR).  (2.16) 

Notice  that  A£  does  not  involve  stochastic  integration,  and  thus  is  well  defined 
for  all  r]  E  H0,  and  not  just  on  a  set  of  e-Wiener  measure  one.  Further,  A£ 
depends  continuously  on  y  £  fi0.  These  properties  are  inherited  by  the  measures 
we  next  define: 


2e(A)M 

ne(A)M 


{A  6  B(fT),  r,  e  f!o). 


and  also 


^t(A)[v] 


E£  ({0  :  0t  E  A})  [y]  (A  E  B(IRn),  y  G  ft0), 

^£(A)W 

crf(lRn)[r/]' 


In  contrast  to  (2.11),  this  can  be  interpreted  as  a  pathwise  filter  7t£: 


(2.17) 

(2.18) 


(2.19) 

(2.20) 


tt£  :  O0,t  -4  V(Mn) 
v  >-»  7ft  M 


(2.21) 


These  measures  also  define  versions  of  the  above  conditional  probabilities  via 
E(H)£H  =  E£(A)[F£(u,)]  P-a.s.  (A  G  B(Qn),  cu  G  O),  (2.22) 


and  in  particular  we  have 


/«.  W)A'(^.y,G))AyM|(‘i<l) 


P-a.s. 


(2.23) 
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We  now  prove  that  these  robust  measures  are  continuous  functions  of  r/  € 
flo-  This  has  been  established  by  several  authors;  for  example  Kushner  [41], 
Sussmann  [53]. 

Proposition  2.1  The  map  defined  by  (2.20),  (2.21)  is  locally  Lipshitz  con¬ 
tinuous. 


Proof:  1.  Let  K  C  fio  be  compact,  and  rj,(  6  K.  We  will  show  that  there 
exists  a  constant  Ck,c  >  0  such  that 


eKfo]»<[C 3)  <  cK,e\\v-( 


(2.24) 


2.  For  any  M  >  0,  there  exists  C  >  0  such  that 


J(M)  —  sup  Eexp( - -=  f  r}sDh(xcs)dws )  <  e  <= .  (2.25) 

IWI<m  V  VeJo  J 


We  have 


/  Vs  I  Dh(x\)  | 2  ds  <  [  ||  ? 7  ||2||  ||2  ds  <  C. 

Jo  Jo 

Then  by  Lemma  7.1.2  of  Kallianpur  [34], 


1  —  E  exp  (  — 


1 


e  L 


[  r/sDh{xts)d.wa  -  i  f  pi  \  Dh(x\)  |2  ds 
o  Z  Jo 


e 

i  r* 


>  e  €  E  exp  ( - p  /  rjsDh(xcs)dws  , 

\/t  Jo  J 


which  implies  (2.25). 

3.  Let  F  C  JRn  be  closed,  and  set  M  —  snpri&K  ||  77  ||.  Following  the  method  of 
Sussmann  [53],  we  shall  prove  that  there  exists  C  >  0  depending  on  K,  e  but 
not  on  F  such  that 


*,e(F)M  -  s;(f)[ci  1  <  c  11 ,  -  <  n . 
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Note  that  we  can  write 


<r;(f)fo]  =  EL,€Fex p  (^G(X‘,ri)j 

where 

G(Xe,rj )  =  r)th(xet )  -  /  ^h(x*)2  +  VsAth(xcs)ds  -  \ft  f  r]sDh(xl)dws. 

Jo  l  Jo 

Then 


I  a;(F)[i;)  -  s;(F)[c]  |<  E  (ciatx‘^  -  etc(-*v)) 

Eli  exp  (iG(X',M)  +  *G(X',(1  -  A)0)  d\i  (G(X‘,V)  -  G(X',Q) . 


Define 


rt  _  ft 

F(Xc,-q)  =  r)th(x\)  -  /  T]sA£h(xes)ds  -  x/e  /  rjsDh(xes)dws. 

Jo  Jo 

Note  that  for  7  >  0, 


x  |  <  7Cosh(— ). 

7 


Using  this  inequality  with  7  =||  7  —  C  j| ,  we  get 


vKFM  ~  vt(F){(}  |  < 


h  -  C II 


2 

< 

exp 


£ 


I1  _i 

/  e 

Jo 


~7  f0‘ HO2  ds 


U(Af‘,A,  +  (l-A)C)-||^U| 


+  exp 

<  II  V  ~  C  II  e^J{M  +  1) 


dA 


as  required. 

4.  Next  we  have 


7T, 


(F)M-^(F)[C]|  < 


_ 1 _ 

a-f(iR”)[7]af(iRn)[C] 


<  II  7  -  C  II  • 
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5.  Let  6  =  CK,c  ||  rj  —  (  ||  .  Then 

<  *,V‘)[C]  +  «, 

which  implies  (2.24).  | 

2.1.2  Zakai  Equation 

Consider  the  unnormalised  conditional  measures  aet  defined  by  (2.8).  Define  a 
new  measure  P0  on  (Cl,  T)  by 

^  =  A  ’T(X‘ ,¥<)-'■ 

Then,  under  Pq, 

m  = 

is  a  standard  Wiener  process  with  respect  to  the  filtration  {y\ }. 

Under  our  assumptions,  <y\  has  a  density: 

o\(A)  =  [  qe(x,t)dx  ( AeB(Rn )). 

Ja 

The  next  theorem  gives  (linear)  stochastic  partial  differential  equations  sat¬ 
isfied  by  (7j  and  qc.  These  are  the  Zakai  equations  of  nonlinear  filtering,  in 
weak  and  strong  form.  See,  for  example,  Elliott  [13],  Liptser  and  Shiryayev  [43], 
Kallianpur  [34],  etc. 

Theorem  (Zakai  Equation )  Let  <j>  €  Cl(Mn).  We  have 

<r\(<l>)  -  Ho (<f>)  +  I  cres(Ae(j))ds  +  -  f  ecs(<f>h)dyl  P- a.s.,  (2.26) 

Jo  e  Jo 
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and 


dqc(x,t )  =  A*qe(x,t)dt -\ — h(x)qe(x,t)dyc(t)  P- a.s.,  (2.27) 

9e(z,0)  =  Po(x)- 


The  Zakai  equation  can  be  regarded  as  part  of  an  infinite  dimensional  reali¬ 
sation  of  the  stochastic  filter  7re  (2.11),  as  discussed  in  Hijab  [25]. 

From  Section  2.1.1,  we  have  a  Feynman-Kac  type  representation  for  the 
solution  of  the  SPDE  (2.26): 

o*M)=  /  M)A l(6,Y<)P<x(dO)  P- a.s.  (2.28) 

Jnn 

A  similar  representation  can  be  obtained  for  the  solution  of  (2.27)  using  the 
robust  Zakai  equation  derived  below.  As  noted  in  the  previous  section,  such 
pathwise  versions  will  be  useful  in  the  sequel. 

Define 

pe(x,t)(uj)  =  exp  q\x,t){u).  (2.29) 

Using  Ito’s  rule,  p€  satisfies 

§-tp€(x,t)(u)  -  ^Apc(x,t)(uj)  +  g(x,y't(w))  ■  Dpe(x,t)(u)  (2.30) 

+  +  edivg(x,yet(u)))  p*(x,t)  =  0, 

p£(x,0)(u;)  =  Po{x),  P- a.s. 

and  note  that  this  equation  does  not  involve  stochastic  integration.  It  is  a  PDE 
with  random  coefficients  since  it  depends  on  lo  G  Cl  via  Yf.  Thus  fix  ?/  G  flo  and 
denote  by  pe(x,t)  =  p€(x,t)[q]  G  Cl'2{IRn  x  [0,T])  the  unique  positive  solution 


18 


of 


§-tpe(x,t)-^Ap€(x,t)+g(x,r]t)-Dpe(x,t)+-(Ve(x,r)t)  +  ediv</(x,  pt))  pe(x,  t)  =  0, 


(2.31) 


pc{x,0)  =  pc0(x), 


We  refer  to  (2.31)  as  the  robust  or  pathwise  Zakai  equation,  and  it  forms  part  of 
an  infinite  dimensional  realisation  of  the  pathwise  filter  7fe  (2.21).  This  defines 
a  version  of  the  solutions  to  (2.27),  (2.30)  via 


pe(x,t)(u})  =  pc(x,  t)[F£(o>)j  P-a.s.  (2.32) 

Let  Px,t,[n]  denote  the  distribution  on  fl"  of  the  diffusion: 

dxe(s )  =  —  g(xc(s),T](t  —  s))ds  +  \/edw(s),  0  <  s  <  t, 
xe(0)  =  x 

Then  using  the  Feynman-Kac  formula  (see  Freidlin  [21],  Theorem  2.2,  page  132), 
we  have 


pe(x,t)[r)]=  (2.33) 

Inn  pco(0t)  exp  (-J  [/o  (Ve(0s,  rj(t  -  5))  +  edi vg(9s,  1 j(t  -  3)))  ds\ )  PXftM\{d9). 


This  together  with  (2.32),  (2.29)  yields  a  functional  integral  representation  for 
qc  P-a.s.  Note  also: 


7T 


mn 


fRn  (f>(x')e1ty‘(“P(Ppe(x,t)[Yc(u)\dx 
fR n  t)[Y€((jj)]dx 


P-a.s. 


(2.34) 


We  now  give  a  bound  and  continuity  result  for 


ie  fnl 


Proposition  2.2  Let  K  C  be  compact.  Then  there  exists  a  positive  constant 
C  such  that 


19 


for  all  r/  £  K  and  all  (x,t)  €  !Rn  x  [0,T],  and 


sup  |  Pc(x,t)[r]]  -p*(x,t)[(]  |  <  C||t/-C|| 

(a;,7)£finx[0,T] 

for  all  77,  (  6  K . 

Proof:  To  obtain  the  bound,  use  the  representation  formula  (2.33)  and  reduce 
to  a  gaussian  integral  with  Girsanov’s  Theorem.  Then  bound  this  integral. 

The  proof  of  the  continuity  result  is  similar  to  that  of  Proposition  2.1  and 
will  be  omitted.  | 


2.2  Large  Deviations 

Let  (X,  d)  be  a  complete  separable  metric  space,  equipped  with  the  Borel  a- 
algebra  Z?(X).  For  e  >  0  let  {P£}  be  a  family  of  probability  measures  on  X  with 
P£  =*  <5xo  as  e  — »  0,  for  some  x0  €  X  (weak  convergence  of  probability  measures). 
Large  deviations  is  concerned  with  estimating  the  rate  at  which  P£(/l)  — >  0  as 
e  — >  0  for  events  A  with  Xo  ^  A\  that  is,  to  study  asymptotic  probabilities  of 
“rare”  events.  These  asymptotics  are  characterised  by  a  deterministic  variational 
problem:  minimise  an  action  or  rate  function  over  the  set  A. 

A  general  formulation  for  studying  such  problems  was  given  by  Varadhan  [54], 
[55],  and  the  measures  {P£}  are  then  said  to  satify  a  large  deviation  principle 
(LDP).  Freidlin  and  Wentzell  [56],  [20]  have  studied  extensively  problems  con¬ 
cerned  with  small  random  perturbations  of  dynamical  systems.  There  are  close 
links  to  PDE  problems  with  small  parameters;  see  for  example  Varadhan  [55], 
Freidlin  and  Wentzell  [20],  Freidlin  [21],  and  the  references  contained  therein. 

A  large  deviation  principle  for  a  family  {P£}  can  be  obtained  using  purely 
probabilistic  methods  (for  example,  Ellis  [14],  who  treats  statistical  mechanics 
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problems  using  large  deviations).  Sometimes  the  problem  can  be  expressed  as 
an  equivalent  PDE  problem,  which  may  be  solved  using  probabilistic  methods 
via  a  representation  formula  (for  example,  Freidlin  [21]),  or  stochastic  control 
techniques  (Fleming  [17]),  or  by  using  direct  analytic  (PDE)  arguements  (for 
example,  Evans  and  Ishii  [16]). 

Our  interest  is  in  the  small  parameter  asymptotics  of  the  filtering  problem, 
that  is,  in  large  deviation  results  for  conditional  measures  and  related  asymp¬ 
totics  of  the  filtering  equations.  Below  we  define  a  LDP  for  families  of  random 
measures. 


2.2.1  Varadhan’s  Formulation 

Let  X  be  defined  as  above,  and  let  {Pc}  be  a  family  of  probability  measures  on 
X. 


Definition  (Varadhan  [54] )  We  say  that  {P£}  obeys  the  large  deviation  principle 
(LDP)  with  action  function  I  :  X  — >  [0,oo]  provided: 

(i)  I  is  lower  semicontinuous; 

(ii)  For  each  M  >  0,  the  set  {x  6  X  :  7(x)  <  M }  is  compact; 

(iii)  For  any  closed  set  C  C  X, 


limsupelog  Pe(C)  < 


-  inf  7(x); 


(iv)  For  any  open  set  GcX, 


liminfelogPTG')  >  -  inf  7(x). 

V  ’  —  xeG  v  ' 


Typically,  7(x0)  =  0  and  7(x)  >  0  if  x  ^  x0. 
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EXAMPLE  Let  X  =  lRn ,  and  consider  the  random  variables  xe0  with  distribution 
fj,c0  and  density  pe0  defined  by  (2.1).  Then  for  regular  A  E  B{IRn)  one  has 

lirn/4(^)  =  -  inf  Sofa). 

c — ►U  x£A 

So  if  Xo  $  A  the  decay  rate  is  given  by 

Mo  (A)  x  e^infl^soWj 

as  e  — >  0.  The  symbol  “x”  denotes  logarithmic  equivalence  in  the  sense  of  (iii) 
and  (iv)  of  the  above  definition.  In  terms  of  densities, 

lime  log  Pq(x)  =  -S0(x). 

e— >0 

Here  the  action  function  is  I(x)  =  5o(a;).  Ill 

There  are  two  important  theorems  for  obtaining  new  large  deviation  results 
from  old  ones.  Each  has  a  number  of  variants. 


Theorem  ( Contraction  Principle,  Varadhan  [54]  )  Let  {Pe}  obey  the  LDP  with 
action  function  I .  Let  Ft  :  X  — ►  Y  be  continuous,  where  Y  is  another  complete 
separable  metric  space.  Assume  lime_,o  Fc  =  F  exists  uniformly  on  compact 
subsets  ofX.  Define  {Qe}  on  Y  by  Q£  =  PeF~1 .  Then  {Qe}  obey  the  LDP  with 
action  function 

=  inf 

x:F(x)=y 

If  y  is  not  in  the  range  of  F ,  then  set  J( y)  =  +oo. 


I(x). 


Theorem  ( Varadhan’s  Theorem  [54],  [55]  )  Let  {Pe}  obey  the  LDP  with  action 
function  I.  Let  Fc,  F  :  X  — >  IR  be  bounded  continuous  functions  such  that 
Fe  — >  F  uniformly  as  e  0.  Then  for  any  closed  set  C  C  X  and  any  open  set 
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G  C  X,  we  have: 


lim  sup  e  log  /  exp 
£ — >0  Jc 

lim  inf  e  log  /  exp 
c— o  Jo 


1 


PW  <  -inf[/(x)  +  F(x)] 


L  /(x)jP£(^x)  >  -inf[/(x)  +  F(x)]. 


The  second  result  is  an  extension  to  function  space  of  Laplace’s  asymptotic 
method.  The  next  result  is  a  version  of  Laplace’s  method  (c.f.  Freidlin  and 
Wentzell  [20],  Chapter  3). 


Theorem  2.1  (Laplace’s  Asymptotic  Method)  Let  F  :  lRn  — >  IR  be  Borel 
measurable,  bounded  below.  Assume  that  for  each  t  >  0, 


a(e)  =  /  e  ^^dx  <  co. 

Jnn 

(i)  V 

lim  sup  clog  a  (e)  <  M 
£— >0 

for  some  constant  M  <  oo,  then  for  any  closed  subset  C  of  ]Rn ,  we  have 
limsupelog  f  e~'F^dx  <  —  inf  F(x). 

e— ,0  JC  ~  xtC  V  ' 

(ii)  If  F  is  upper  semicontinuous,  then  for  any  open  subset  G  of  IRn ,  we  have 

lim  inf  e  log  f  e~‘F^dx  >  —  inf  F(x). 

£ — *0  &  JG  ~  xdG  ' 


Proof:  Upper  bound.  Write  m  =  infxeC  F(x)  and  assume  m  <  oo.  Let 

0  <  A  <  1.  Then 


/  e-i F^ix  =  I 
Jc  Jc 
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< 

< 


and  so 

elog  /  e~^xUx  <  -(1-  \)m  +  \(j)\oga(^). 

JC  A  A 

Therefore 

lim sup  elog  /  e~^F^dx  <  —  (1  —  A)m  +  AM. 
e— *0  dc 

Let  A  — *  0  to  obtain  the  required  inequality. 

Lower  bound.  Now  write  m  =  inf xeaF(x),  and  assume  m  <  oo.  For  any 
8  >  0  define 

Gs  =  {x  £  G  :  F(x)  <  m  +  8,  \  x  \<  R} 


where  R  >  0  is  chosen  large  enough  to  ensure  G$  ^  0.  Then  Gg  is  a  bounded 
open  subset  of  G ,  and 


/  e-l‘F(x)dx  >  [  e-^m+sUx 

Jg  Jg6 

>  Ln{Gs)e-^m+s\ 


and  hence 

lim  inf  e  log  J  e~lF^dx  >  —(m  +  8). 

This  holds  for  all  8  >  0,  hence  the  lower  bound  follows.  | 


2.2.2  Wentzell-Freidlin  Theory 

The  Wentzell-Freidlin  theory  is  concerned  with  small  random  perturbations  of 
the  dynamical  system 

x(s)  =  f{x(s)),  x(0)  =  *o,  (2.35) 
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for  5  £  [0,  T],  given  by,  for  example,  the  SDE  (2.2)  with  small  parameter  e  >  0. 
The  parameter  e  is  a  measure  of  the  intensity  of  the  noise,  and  as  e  — *  0,  the 
trajectories  of  (2.2)  are  “close”  to  the  trajectory  of  (2.35)  with  high  probability. 
However,  there  is  a  positive  (exponentially  small)  probability  that  the  trajecto¬ 
ries  of  (2.2)  will  be  far  from  the  trajectory  of  (2.35).  The  behaviour  of  such  rare 
events  is  captured  by  a  LDP. 

We  assume  that  x^  has  density  given  by  (2.1).  Noting  that  P(x\  £  A)  = 
fRnP{x\  £  A  |  xe0  =  x)fiQ(dx),  the  following  slight  extension  of  Theorem  1.2, 
page  45,  of  Freidlin  and  Wentzell  [20]  is  easily  obtained. 

Lemma  2.1  Under  the  above  assumptions  we  have: 

(i)  there  is  a  constant  C  >  0  depending  on  T  >  0  such  that 

sup  E  |  x\  —  xt  |2  <  eC\ 

0  <t<T 

(ii)  for  all  t  >  0  and  8  >  0, 

lim  P({  max  I  xc(s )  —  ads)  |>  <S})  =  0. 

e_ ,0  v  u<s<i  1  v  '  v  ' 


Assume  temporarily  that  xe0  =  x  a.s.  Let  denote  the  distribution  of  X€ 
on  nn.  Fix  T  >  0  and  consider  the  control  system 

xu(t)  =  f(xu(t))  +  u(t),  0  <  t  <  T,  (2.36) 

*(0)  =  x, 

where  u  :  [0,  T]  — >  IRa  is  measurable.  For  6  £  Qn  define 

Ix{0)  =  inf{i  f1  [  m(<)  I2  dt  :  xu  =  0,  x„(0)  =  x }  (2.37) 

u  L  Jo 
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2  So  I  S  ~~  /(^<)  |2  dt  if  0  is  absolutely  continuous 

and  d(0)  =  x, 

oo  otherwise 


Theorem  ( Freidlin  and  Wentzell  [20],  Theorem  1.1,  page  104)  { Pfx }  obey  the 
LDP  with  action  function  Ix(0). 

Now  suppose  again  that  Xq  has  distribution  pe0,  and  recall  that  Pf  is  the 
distribution  of  Xe  on  f ln: 

PUA)  =  /  Pl{A)Cte-^x)dx. 

J  Rn 

Define  for  0  €  ftn: 

Ix(0)  =  inf  Is'oWO))  +  \[  \  «(<)  I2  dt  :  =  0  j  .  (2.38) 


Theorem  2.2  {P£}  obey  the  LDP  with  action  function  Ix(Q)- 


Proof:  Upper  bound.  For  any  S  >  0,  from  the  previous  theorem  we  have,  for 


Ccfi"  closed, 


P‘(C)  <  exp(-i 


inf  1,(0)  -  e 


for  e  >  0  sufficiently  small.  Then 


Pe(C)  <  Cc  [  exp  — 
J  Rn  \  C 


dx 


and  hence 


limsup  e  log  Pe  (C)  < 

e — >0 


—  inf 

x  £Rn 


S0(x)  +  inf  h(0) 


+  6 


=  -Ix(0)  +  6, 
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using  Theorem  2.1.  Since  8  >  0  was  arbitrary,  the  upper  bound  follows. 

Lower  bound.  Note  that  if  G  C  fi"  is  open,  then 

x  t->  inf  Ix(0 ) 

OGG 

is  upper  semi  continuous.  The  lower  bound  follows  using 

PX(G)  >  exp(-^  + 

and  Theorem  2.1.  ^ 


Let  PxtY,  Py  denote  the  distributions  on  Vtn  x  fl0,  Do  of  (.X£,Te),  Ye  re¬ 
spectively.  Define  for  (0,?/)  G  Lln  x  O0, 

i.S'o(0o)  +  I  So  !  i  -  f(Os )  I2  +  I  r,s  -  h(Os)  |2  ds 
Ix,y{&,v)  =  ‘  if  (0,?/)  is  absolutely  continuous,  (2.39) 

oo  otherwise; 

and  for  7/  G  SIq, 


IY(ri)  =  mfnIjr,y(0,  t/). 


(2.40) 


Corollary  2.1  { Px,y I  an( ^  { Py  }  °bey  the  LDP  with  action  functions  Ix,y{0,v) 
and  Iy(i] )  respectively. 

2.2.3  Large  Deviations  for  Random  Measures 

As  before  let  (0,  JF,  P)  be  a  probability  space,  and  let  X  be  a  complete  separable 
metric  space.  We  are  interested  in  the  following  situation:  {Q€}  is  a  family  of 
random  probability  measures  on  X  with 

Q'  4  0xo 
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for  some  x0  €  X.  That  is,  the  probability  measures  Qe  converge  weakly  in 
probability  to  the  point  mass  at  Xo:  for  all  a  >  0, 

limP  (£>(Q£,6x0)  >  «)  =  0. 

£—>•0 

Let  now  {Q£}  be  a  family  of  random  probability  measures  on  X. 

Definition  2.1  We  say  that  {Q£}  obeys  the  large  deviation  principle  in 
probability  (LDPP)  with  action  function  J  :  X  — ►  [0,  oo]  provided: 

(i)  J  is  lower  semicontinuous; 

(ii)  For  each  M  >  0,  the  set  {x  (E  X  :  J(x)  <  M}  is  compact; 

(iii)  For  any  closed  set  CcX, 

limsupelogQ€((7)  <  -  inf  J(x)  in  probability; 

e— >0 

(iv)  For  any  open  set  G  CX, 

liminf  elog  Qe(Cr)  >  -  inf  J(x)  in  probability. 


Remark  This  definition  is  smiliar  to  a  definition  by  Ji  [31],  who  considered 
filtering  problems  where  a  small  parameter  appears  in  the  observation  equation 
only,  and  not  in  the  state  equation.  /// 
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Chapter  3 

The  Observer  Problem 


In  this  chapter  we  define  what  we  mean  by  the  term  observer.  The  defi¬ 
nition  given  is  rather  general,  in  as  much  as  the  observer  state  space  may  be 
infinite  dimensional,  as  is  typically  the  case  in  nonlinear  filtering.  We  discuss 
the  well  known  definitions  concerning  observability  in  the  light  of  our  definition, 
and  introduce  a  notion  of  uniform  detectability  which  extends  the  definition  of 
detectability  for  linear  systems. 

The  asymptotic  filtering  method  is  applied  to  detectable  linear  systems.  Fi¬ 
nally,  we  review  several  approaches  proposed  by  other  authors. 

3.1  Problem  Statement 

We  wish  to  design  an  observer  for  the  nonlinear  control  system 

x  =  f(x,u ),  x(0)  =  x0,  (3.1) 

y  =  h(x), 

with  state  x(t)  £  lRn,  control  u(t )  £  [-1,1]™,  and  observation  y(t )  £  TRP.  The 
initial  condition  Xq  £  Mn  is  unknown. 

The  problem  is  to  use  the  available  information  t  i— »  u(t )  and  t  y(t )  to 
compute  an  estimate  x(t)  of  the  state  x{t)  such  that  the  error  e{t)  =  x{t )  —  xit ) 
converges  to  zero  as  t  — *•  oo. 

In  general,  some  a  priori  knowledge  concerning  the  initial  condition  x0  is 
required.  We  quantify  this  by  saying  that  x0  belongs  to  some  class  T  which 
depends  on:  the  problem  data  /,  h\  the  initial  estimate  xq\  and  on  various 
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design  parameters.  For  example,  X  =  { x0  €  Mn  :  \  x0  -  x0  \<  p}.  It  is  desirable 
that  X  be  as  large  as  possible,  to  maximise  the  permissible  initial  uncertainty. 

Definition  3.1  An  observer  for  the  control  system  (3.1)  is  a  dynamical  system 

m  =  F(m,u,y),  m(0)  =  m0,  (3.2) 

x  —  G(m), 

such  that 

fim  |  x(t)  —  x(t)  |  =  0  (3.3) 

for  all  xq  G  I,  where  X  is  a  subset  of  lRn . 

Here,  m{t)  €  M  and  x(t )  G  ]Rn.  The  observer  state  space  M.  may  be  infinite 
dimensional  (c.f.  nonlinear  filtering).  Of  course,  observers  for  which  M.  =  Ml, 
for  some  /  <  oo,  are  most  useful. 


3.2  Observability  and  Detectability 

For  an  observer  design  to  be  successful,  the  measurements  t  — >  u(t)  and  t  — >  y(t) 
must  contain  “enough”  information  about  the  state  trajectory.  This  information, 
together  with  X ,  is  used  by  the  observer  to  estimate  the  state.  In  an  effort  to 
make  precise  what  “enough”  means,  in  this  section,  we  study  various  notions  of 
observability. 

Observability  addresses  the  question:  what  does  the  information  contained  in 
the  measurements  reveal  about  the  states?  More  precisely,  consider  the  control 
system  (3.1)  and  let  denote  the  flow  corresponding  to  a  control  u.  :  t 

u(t).  Let  Xq  denote  two  initial  conditions.  Can  the  states  and  be 

distinguished  by  comparing  their  observation  records  for  some  control  u? 
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Definition  The  control  system  (3.1)  is  observable  if  given  any  x l  ^  Xq  E  lRn 
there  exists  a  control  u  such  that 

h  (lu(t)xl')  ^  h(^u{t)xfj  (3.4) 

for  some  t  >  0.  If  (3.4)  holds  for  some  u,  we  say  that  the  states  x\  and  x%  are 
distinguishable  with  respect  to  the  control  u. 

Note  that  in  general,  p  <  n  and  h  :  Mn  — >  Mp  is  not  one  to  one:  h  is  often 
a  submersion.  Also,  the  control  enters  the  definition  in  an  essential  way.  There 
may  be  many  controls  for  which  Xq  and  Xq  are  indistinguishable. 

In  general,  it  is  difficult  to  give  criteria  for  testing  whether  or  not  a  given 
system  is  observable.  A  number  of  authors  have  addressed  this  issue,  most 
notably  in  the  pioneering  work  of  Hermann  and  Krener  [24].  See  also  Isidori 
[28]. 

There  is  an  algebraic  test  for  a  weak  form  of  local  observability.  Let  U  be  a 
neighbourhood  of  xQ  E  IRn.  Define 

I(x o,  U )  =  {x'o  £  :  Xg  is  not  distinguishable  from  xq  with 

respect  to  u  and  -fu{t)xo  £  U,  -fu(t)xl  E  U  for  all  t}. 


Definition  The  control  system  (3.1)  is  said  to  have  the  local  distinguishability 
property  if  every  xo  E  Mn  has  an  open  neighbourhood  V  such  that  for  all  open 
neighbourhoods  U  of  x o,  U  C  V,  one  has  I(xo,U)  fl  V  =  {^o}. 

Let 

(Lf\dh)  =  Span{%iU)(d/^):uE[-l,l]m,  j  =  l,...,p;  *  =  1,2,...}  (3.5) 
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denote  the  smallest  codistribution  containing  {dh1,  •  ■  ■ ,  dhp}  which  is  invariant 
under  the  vector  fields  {/(-,tt)  :  u  £  [— l,l]m}.  Here,  -h/(.,u)  denotes  the  Lie 
derivative  with  respect  to  the  vector  field  /(•,  u ),  and  dh 3  is  the  gradient  1-form 
for  the  real  valued  function  h3,  j  —  1, . . .  ,p,  and  the  span  is  taken  with  respect 
to  the  ring  C°°(Mn).  This  is  called  the  observability  codistribution  [24]. 

Definition  The  control  system  (3.1)  is  said  to  satisfy  the  observability  rank 
condition  at  xQ  £  ]Rn  if  there  exists  a  neighbourhood  U  of  x0  such  that 

rank (Lj  \  dh)(x)  =  n  (3.6) 


for  all  x  £  U . 

Theorem  (Hermann-Krener  [24])  If  the  control  system  (3.1)  satisfies  the  ob- 
servabilty  rank  condition  at  xo,  then  the  system  has  the  local  distinguishability 
property  at  Xo-  Conversely,  if  the  control  system  (3.1)  has  the  local  distinguisha¬ 
bility  property,  then  the  observability  rank  condition  is  satisfied  generically. 

The  idea  is  that  if  the  observability  rank  condition  fails,  then  the  leaves 
of  the  foliation  induced  by  the  annihilator  of  the  observability  codistribution 
form  equivalence  classes  of  indistinguishable  states.  Such  local  results  amount 
essentially  to  obtaining  information  by  “differentiating  the  output”. 

The  definition  of  observability  is  connected  with  the  problem  of  determining 
exactly  the  unknown  state.  As  we  shall  see  in  Chapter  6,  it  turns  out  that, 
rather  surprisingly,  a  notion  weaker  than  observability  is  sufficient  for  successful 
observer  construction.  We  are  not  necessarily  trying  to  recover  the  state  exactly. 
Rather,  it  suffices  to  estimate  the  state  trajectory  asymptotically  as  t  — >  oo.  One 
can  construct  observers  for  detectable  linear  systems  (defined  in  the  example 
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below)  which  are  not  observable.  This  was  first  noted  by  Wonham;  see  [59], 
Chapter  3.  The  situation  is  similar  for  nonlinear  systems;  see  Chapter  6. 

Define 


A{x,u)  =  Df(x,u ), 

H(x)  =  Dh(x), 

where  D  denotes  the  gradient  in  the  x  variables.  Assume  that 

||  A  ||  —  sup  ||  A(x,u )  ||  <  oo, 

xeRn,  M<i 

||  H  ||  =  sup  ||  Ii(x)  ||  <  oo. 

xeiRn 


Definition  3.2  The  control  system  (3.1)  is  uniformly  detectable  if  there  exists 
a  bounded  continuous  matrix  valued  map  A(ar,u)  and  a  constant  cto  >  0 

such  that 

rj' (A(x,u)  +  A(x,u)H(x))r]  <  -a0  |  y  |2  (3.7) 

for  all  r)  6  lRn,  x  €  Mn,  u  €  [—1,  l]m. 

A  disadvantage  of  this  condition  is  that  it  is  in  general  difficult  to  check,  and 
A (x,u)  may  be  hard  to  compute.  No  simple  rank-type  condition  exists  to  date. 
A  simple  but  less  general  condition  is  the  following. 

Definition  3.3  The  control  system  (3.1)  is  uniformly  of  full  rank  if  there  exists 
a  constant  so  >  0  such  that 

H(x)'H(x )  >s0I  (3.8) 

for  all  x  E  Ma . 
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EXAMPLE  ( Linear  Systems)  Consider 


x  =  Ax  +  J5u,  (3-9) 

V  =  Cx. 


The  system  (3.9)  (or  the  pair  (C,A))  is  observable  if 


p|  ker (CA'~l) 
i— 1 

Here, 


( Lf  |  dh)  =  span {<7,  CA,  •  •  • ,  CA^1} 


and  so  observability  is  equivalent  to  the  rank  condition 


rank  [<7,  CA,  •  •  • ,  CLT1"1]  =  n. 


Let  <S(A)  denote  the  stable  subspace  of  A,  that  is,  the  span  of  the  (generalised) 
eigenvectors  corresponding  to  eigenvalues  with  strictly  negative  real  part.  The 
pair  (C,A)  is  detectable  if 

f)k eriCA1-1)  C5(d). 

i=l 

This  is  equivalent  to  the  existence  of  a  matrix  A  such  that  the  eigenvalues  of  the 
matrix  A  +  AC  have  strictly  negative  real  parts  (Wonham  [59]).  Thus  uniform 
detectability  for  ( C ,  A)  implies  detectability,  but  not  conversely.  States  a;J,  x q 
are  indistinguishable  if 

xo~xl  €  H  MCA'"1). 

i— 1 

So  observability  implies  detectability,  but  not  conversely.  /// 

Thus  uniform  detectability  requires  roughly  speaking  that  states  which  are 
unobservable  decay  exponentially  to  zero. 
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Remark  Filtering  theory  does  not  generally  address  the  observability  issue. 
Mitter  [45]  mentions  observability  in  connection  with  his  discussion  of  the  ex¬ 
tended  Kalman  filter;  we  discuss  this  further  in  Chapter  7.  The  conditional  ex¬ 
pectation  gives  the  conditional  minimum  variance  estimate,  but  the  error  does 
not  in  general  converge  to  zero  as  t  — »  oo;  see  Kunita  [36].  /// 


3.3  Application  to  Linear  Systems 

In  this  section  we  provide  a  complete  description  of  the  asymptotic  filtering 
method  as  it  applies  to  detectable  linear  systems.  For  further  details,  refer  to 
Baras,  Bensoussan  and  James  [2]. 

The  method  constructs  explicitly  an  observer  for  the  linear  system 

x(t)  =  Ax(t)  +  Bu(t),  x(0)  =  x0,  (3.10) 

y{t)  =  Cx{t ), 

as  the  asymptotic  limit  of  (Kalman)  filters  for  a  family  of  associated  filtering 
problems 

dxc{t)  =  Axe(t)dt  +  Bu(t)dt  +  \/eNdw(t),  xe(0)  =  x^,  (3.11) 

dC{t)  =  Cxe(t)dt  +  ^Rdv(t),  f (0)  =  0. 

Such  a  construction  is  suggested  by  the  fact  that  for  certain  choices  of  Q q  = 
cov(xo),  the  filters  are  independent  of  e,  as  discussed  in  Baras  and  Krishnaprasad 
[!]• 

The  work  of  Hijab  [25],  [26]  is  indispensible  here  in  deriving  a  large  deviation 
principle  for  certain  conditional  measures  (see  Chapter  5),  and  identifying  the 
limit  of  the  filters  for  (3.11)  as  an  associated  deterministic  estimator  (Chapter 

4)- 
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We  assume  that  x(t)  E  JR",  u(t)  E  lRm ,  y(t)  E  Mp ,  and  t  i— >  u(t)  is  piecewise 
continuous. 

The  observer  problem  here  consists  of  constructing  a  dynamical  system 

m(t )  =  Em{t)  +  Fu(t)  +  Gy{t),  m( 0)  =  ra0,  (3.12) 

x(t)  =  Hm(t), 

so  that  the  error 

e[t)  —  x{t)  —  x(t)  (3.13) 

decays  exponentially  fast  to  zero,  at  a  rate  controlled  by  the  designer,  indepen¬ 
dent  from  the  choice  of  mo  and  x0.  Here  the  matrices  E,  F,  G  and  H  are  possibly 
time-varying  and  the  dimension  of  m(t )  is  not  necessarily  n. 

Solutions  to  this  problem  are  well  known,  first  given  by  Luenberger  [44],  In 
particular,  if  the  pair  (C,A)  is  detectable,  then  there  exists  a  matrix  T  such  that 
the  matrix  A  +  TC  has  eigenvalues  in  the  open  left  half  plane.  Then  set 

E  =  A  +  TC,  F  =  B ,  G=-T,  H  =  I. 

In  this  case  the  error  (3.13)  satisfies 

e(t)  —  (A  +  TC)e(t),  e(0)  =  x0  -  m0, 

and  the  eigenvalues  of  A  +  TC  can  be  arbitrarily  assigned  by  the  designer  if 
and  only  if  (C,A)  is  observable.  Typically  T  is  selected  by  transforming  ( C ,  A) 
into  the  observer  standard  form  and  choosing  the  coefficients  of  T  to  achieve  a 
desired  characteristic  polynomial  for  A  +  YC.  Alternatively,  one  may  employ  the 
grammian  approach  for  stabilisation  of  Kleinman  [37]. 

Consider  the  system  (3.10).  Define  £(f)  =  /0*  y(s)ds,  so  that  (3.10)  becomes 

x(t )  =  Ax(t)  +  Bu{t),  x(0)  =  xq,  (3.14) 

i(t)  =  Cx(t),  £(0)=0. 
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Then  associate  with  (3.14)  the  family  of  filtering  problems  (3.11),  where  w,  v 
are  independent  standard  ^-dimensional,  respectively  p-dimensional  Brownian 
motions.  The  initial  condition  Xq  is  Gaussian,  independent  from  w,  v  with 
E(x q)  =  m0e,  cov(xq)=(5o  ,  where  Qq  is  positive  definite.  The  matrix  R  is 
assumed  positive  definite. 

As  is  well  known,  the  minimum  variance  estimate  xe(t)  =  E(x(t)  |  £e(s),0  < 
s  <  t)  for  the  linear  Gaussian  filtering  problem  (3.11)  is  given  by  the  Kalman 
filter  [10] 

dxe(t)  =  Axe(t)dt  +  Bu(t)dt  +  Qe{t)C'{RR')-1(dC(t)-Cxc(t)dt), 
a-c(0)  =  m0£,  (3.15) 

where  Qc  satisfies  the  Riccati  equation 

Qe(t)  =  AQc(t)  +  Qe(t)A'  ~  Qt{t)C\RR!)~lCQt{t)  +  NN',  (3.16) 

Q\ 0)  =  Ql/e. 

Note  that  these  filters  depend  on  e  only  via  the  matrix  Qq/ e.  In  fact,  if  we  choose 
Qq  =  eQ o,  then  all  the  filters  are  independent  of  e  and  identical  with  the  filter 
for  e  =  1. 

Define 


m(t)  =  Am(t)  +  Bu(t)  +  Q(t)C'{RR!y\y{t)-Cm{t )),  (3.17) 

m(0)  =  mo, 


where  Q(t)  is  the  solution  of  the  Riccati  equation 


Q(t)  =  AQ{t)  +  Q{t)A  —  Q{t)C'  {RR!)-1  C  Q{t)  +  NN',  (3.18) 

Q(0)  =  Qo 
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As  is  discussed  in  Chapter  4,  m(t)  =  x(t)  is  the  deterministic  minimum  energy 
estimate  of  x(t)  given  the  observation  record  Yt  =  {?/(s);0  <  s  <  t}.  This  is 
closely  related  to  a  large  deviation  principle  for  the  conditional  measures  (Chap¬ 
ter  5),  and  (3.17),  (3.18)  is  the  limiting  filter.  In  fact,  if  lim^o^o  =  ra0  and 
lim£^o  Qo  =  Qo,  one  can  prove: 

limi?  |  x£(t )  —  m(t)  |2=  0 

uniformly  on  [0,  T].  In  fact,  m(-)  coincides  with  the  solution  of  (3.10)  for  the 
initial  condition  x(0)  =  m0. 

We  are  interested  in  the  asymptotic  behaviour  of  e(t)  =  x(t)—m(t)  as  t  —*  oo, 
and  we  make  the  natural  assumption  that  the  pair  (C,A)  is  detectable.  We  also 
assume  rank  N  =  n  and  R  >  0.  It  is  important  to  obtain  bounds  on  ||  Q(t)  || 
and  ||  P(t )  ||,  where  P(t)  =  Q(t)~1.  As  will  be  explained  in  Section  6.2,  there 
exist  constants  q  >  0,  p  >  0  such  that 

\\Q(t)\\<q,  ||P(t)||<p 

for  all  t  >  0.  As  in  [2]  and  Section  6.3,  one  can  use  the  Lyapunov  function 
x'P(t)x  to  prove 

|  x(t)  —  m(t)  |<  K  |  xq  —  m0  \  e_7<, 
for  all  t  >  0,  for  some  K  >  0,  7  >  0. 

Thus  the  deterministic  filter  (3.17),  (3.18)  is  an  observer  for  the  linear  control 
system  (3.10).  This  design  is  a  useful  alternative  to  the  standard  Luenberger 
design,  and  can  be  easily  be  extended  to  time  varying  systems  provided  that  we 
assume  uniform  complete  observability  (Kalman’s  terminology  [35]). 
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3.4  Other  Approaches 

In  this  section  we  give  a  brief  survey  of  some  observer  designs  proposed  by  other 
authors  that  have  come  to  our  attention.  This  discussion  will  serve  to  place  our 
approach  in  perspective. 

Since  the  linear  theory  is  well  understood,  it  is  perhaps  natural  to  use  linear 
theory  where  possible  in  designing  observers  for  nonlinear  systems.  One  obvious 
approach  is  to  linearise  the  system  (3.1)  about  an  equilibrium  xo,  u$\  f(xo,uo )  = 
0.  Expanding  x(t)  =  x0  +  8x(t)  +  . . . ,  u(t)  =  uQ  +  8u(t)  +  . . .,  we  obtain  the 
linear  system 


8x(t)  =  A08x(t)  +  B06u(t),  (3.19) 

8iy(t)  =  C08x(t). 

Then  if  (Co,  Aq)  is  detectable,  one  might  design  an  observer  for  (3.19)  and  use  it 
to  estimate  x(t),  the  state  of  (3.1).  This  of  course  would  be  valid  only  for  small 
excursions  from  the  equilibrium. 

Another  approach  is  to  try  to  find  coordinates  in  which  the  system  exhibits 
suitable  linear  features.  For  example,  Krener  and  Respondek  [39]  consider  the 
system 


Z  =  /(£,«),  £(o)»&,  (3.20) 

4  =  HO, 

in  some  neighbourhood  of  a  given  point  £0.  Here,  £  £  Mn ,  <f)  £  1RP.  Under  certain 
conditions,  they  find  coordinate  transformations  E  :  IRn  — >  ]Rn ,  $  :  1RP  — *  1RP, 

Z  =  4  =  $(y)> 
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so  that  (3.20)  becomes 


x  =  Ax  +  i(y,u),  (3.21) 

y  =  Cx, 

where  ((7,  A)  is  an  observable  pair  in  observer  standard  form.  Then  T  can  be 
chosen  so  that  A  +  YC  has  eigenvalues  with  negative  real  parts  and  an  observer 
for  (3.21)  is 

i  =  {A  +  TC)z  -Ty  +  7(2/,  u), 
with  error  equation  (e  —  x  —  z) 

e  =  (A  +  TC)e. 

To  obtain  an  observer  for  (3.20),  one  writes  the  ODE  for  (  =  E(z): 

C  = 

Then  at  least  on  a  compact  region  containing  £0)  the  error  £  —  (  decays  expo¬ 
nentially  to  zero. 

Most  of  their  paper  [24]  is  devoted  to  conditions  under  which  the  above 
procedure  is  possible.  These  conditions  are  related  to  local  observability,  and  the 
computation  of  the  coordinate  transformations  requires  the  solution  of  certain 
PDEs.  This  approach  has  been  extended  by  Levine  and  Marino  [42],  where 
the  nonlinear  system  is  immersed  in  a  linear  system  of  higher  dimension;  the 
transformations  are  local  immersions  rather  than  local  diffeomorphisms. 

Kuo,  Elliott  and  Tarn  [40]  consider  the  uncontrolled  system 

i  =  /(*),  (3-22) 

y  =  h(x). 
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They  propose  the  following  design: 


z  =  f(z )  +  B(y  -  Kz ))• 

If  there  exists  a  constant  matrix  B  and  a  positive  definite  constant  matrix  Q 
such  that 

Q  (Df(x)  —  BDh(x)) 

is  uniformly  negative  definite,  then  the  error  decays  exponentially  to  zero  for  all 
initial  values.  Their  proof  uses  the  Lyapunov  function  e'Qe.  Notice  in  particular 
that  B  and  Q  must  be  constant,  and  no  constructive  procedure  for  selecting 
these  matrices  is  given. 

Observer  design  for  bilinear  systems 

x  =  +  Y^UiB^j  x  =  A(u) x,  (3.23) 

y  =  Cx, 

is  nontrivial.  A  survey  of  some  design  methods  is  presented  in  Derese,  Stevens 
and  Noldus  [12].  A  candidate  observer  is 

z  =  {A(u)  +  T{u)C)z-T(u)y,  (3.24) 


with  error  equation 

e  =  (A(u)  +  T(u)C)e.  (3.25) 

Then  in  order  that  (3.24)  be  an  observer  for  (3.23),  one  needs  to  find  a  matrix 
valued  function  T(-)  such  that  (3.25)  is  asymptotically  stable  for  all  controls 
t  i->  u(t).  The  results  discussed  in  [12]  address  this  issue,  and  sufficient  con¬ 
ditions  are  obtained  which  are  applicable  in  various  cases.  In  general,  no  con¬ 
structive  method  for  computing  T(-)  is  available,  a  difficulty  shared  with  uniform 
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detectability.  Williamson  [58]  views  (3.23)  as  a  time  varying  linear  system  and 
seeks  to  transform  this  system  into  a  canonical  form. 

We  finally  remark  that  observer  theory  for  linear  systems  remains  an  active 
area  of  research:  0’  Reilly  [49].  For  linear  time  varying  systems,  it  seems  impos¬ 
sible  to  avoid  using  the  observability  grammian;  no  simple  rank-type  condition 
is  known.  See  also  Section  6.4.1. 
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Chapter  4 

Deterministic  Estimation 


In  1968,  Mortensen  [48]  proposed  a  deterministic  method  for  state  estimation 
(filtering),  based  on  the  idea  that  the  most  likely  state  trajectory  has  minimum 
energy  (in  a  certain  sense).  The  resulting  filter  is  infinite  dimensional,  being 
characterised  by  a  Hamilton-Jacobi  equation,  and  generically  the  estimate  is 
not  computable  by  a  system  of  finite  dimensional  equations.  This  difficulty  is 
shared  with  the  stochastic  method  of  filtering  recalled  in  Chapter  2.  In  his  thesis 
[25],  Hijab  studied  Mortensen’s  method  and  compared  it  with  the  stochastic 
approach.  He  was  interested  in  the  finite  dimensional  computability  issue,  as 
well  as  small  parameter  asymptotics. 

We  employ  the  relatively  recent  notion  of  viscosity  solution  for  Hamilton- 
Jacobi  equations  and  give  a  new  treatment  of  Mortensen’s  method.  This  is 
fundamental  to  the  use  of  PDE  techniques  for  proving  the  small  parameter 
asymptotics  in  Chapter  5.  In  addition,  we  obtain  a  continuity  result  when  the 
deterministic  estimator  is  well  defined,  and  introduce  an  approximate  determin¬ 
istic  estimator  which  will  be  the  basis  of  our  observer  design  in  Chapter  6. 

4.1  Formulation 

We  begin  by  reviewing  Mortensen’s  method  [48],  [25]  of  deterministic  minimum 
energy  estimation. 

Given  an  observation  record  Yt  =  {y(.s),  0  <  s  <  i},  0  <  t  <  T,  of  the 
deterministic  system 

x  =  f(x )  +  u,  x(0)  =  x0,  (4.1) 
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y  =  h(x)  +  v,  y( 0)  =  0, 


we  wish  to  estimate  the  state  at  time  t ,  the  initial  condition  xq  being  unknown. 
Here,  u  E  T2([0, T], Hln),  v  E  T2([0,  T],  iR)  are  deterministic  models  for  noise. 
We  assume  that  f  E  Cl(lRn,lRn)  and  h  E  C^(lRn ,  1R).  Then 

Yt  £  Hq,  —  {v  £  Ho,  :  is  absolutely  continuous  and  j|  rj  \\h1<  °°} 

where 


The  most  likely  path  is  one  for  which  u,v  have  least  energy  and  produce  the 
given  observation  record  Yt. 

More  precisely,  define 

Jt(x0,u,v)  =  ^(xo )  +  \jQ  (\u(s)  \2 +v(s)2)  ds,  (4.2) 

where  S0  is  Lipshitz  continuous,  convex,  and  5o(m0)  =  0,  Sq(x)  >  0  if  x  / 
m0  E  lRn.  A  minimum  energy  input  triple  (xq,u*,v*)  given  Yt  is  a  triple  that 
minimises  Jt  subject  to  the  constraint  that  the  trajectory  of  (4.1)  produces  the 
output  Yt.  By  replacing  v(s)  by  y(s)  —  h(x(s))  in  (4.2)  and  omitting  the  y(s )2 
term,  we  can  formulate  an  equivalent  unconstrained  optimal  control  problem. 
Define 

Jt(x0,u)  =  S0{x 0)  +  f  L(x(s),u(s),s)ds,  (4.3) 

Jo 

where 

L(x,u,s)  =  i  |  u  j2  +  ^h(x)2  -  y(s)h(x).  (4.4) 

We  now  minimise  Jt  over  pairs  (x0,  u).  The  deterministic  or  minimum  energy 
estimate  x(t )  given  Yt  is  defined  to  be  the  endpoint  of  the  optimal  trajectory 
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5  t— >  x*(s),  0  <  s  <t,  corresponding  to  a  minimum  energy  pair  (sq,  u *)  :  x(t )  = 
x*(t). 

Next,  we  use  dynamic  programming  to  study  this  problem.  Given  a  control 
t  i — >  u(t),  let  xu  denote  the  corresponding  trajectory  (given  a  specified  initial 
condition).  Following  the  general  scheme  presented  in  Fleming  and  Rishel  [19], 
define  a  class  of  admissible  pairs  ( xq ,  u)  by 

UXit  =  {(x0,u)  £  IRn  x  L2([0,T],IRn)  :  xu(0)  =  x0,  xu(t)  =  x}\  (4.5) 

that  is,  pairs  for  which  the  corresponding  trajectory  passes  through  a  specified 
point  x  at  time  t.  Define  a  value  function 

W(x,t)  =  inf  Jt(x0,u).  (4.6) 

(a-o.ti)  e  ux<t 

Note  that  this  is  a  reversal  of  the  standard  set-up  of  dynamic  programming 
[19].  By  using  standard  methods,  we  see  that  W(x,t)  is  continuous  and  formally 
satisfies  the  Hamilton- Jacobi-Bellman  (HJB)  equation 

ftW(x,t)  +  H(x,t,DW(x,t))  =  0,  (4.7) 

W{x,  0)  =  So(x), 

where 

H(x,t ,  A)  =  max  {A  •(/(*)  +  u)  —  L(x,  u,  t)}  .  (4.8) 

u  6  U 

W(x ,  t)  is  the  minimum  value  of  Jt  subject  to  the  end  point  condition  xu(t )  =  x. 
The  deterministic  estimate  is  given  by: 

x{t)  =  W(x,t),  (4.9) 

and  we  say  that  the  deterministic  estimator  is  well  defined  if  x(t )  is  the  unique 
minimiser  of  W(-,t )  for  each  t  and  each  observation  record.  When  we  wish  to 
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make  explicit  the  dependence  on  the  observation  path,  we  write  W(x,t)[Y],  etc. 
As  described  by  Hijab  [25],  this  gives  a  deterministic  estimator  (filter)  ft: 

%  ■  ft#  -  V(Mn)  (4.10) 

Yt  - 

In  control  theoretic  terms,  we  can  regard  the  IIJB  equation  (4.7)  as  part  of  an 
infinite  dimensional  realisation  of  the  map  ft  (c.f.  Hijab  [25]). 

Proposition  4.1  For  any  Yt  £  ^o,t  >  there  exists  a  minimum  energy  pair 
( Xq,u *).  If  the  map  Jt  :  fln  — >  1R  defined  by 

jt(0)  =  So(0o)  +  J^-\  0S-  f(0s )  |2  +\h(0s)2  -  ysh(0s)ds  (4.11) 

is  strictly  convex,  then  the  minimum  energy  pair  (xq,  u*)  is  unique.  If  this  holds 
for  each  0  <  t  <  T,  then  the  deterministic  estimator  (4.10)  is  well  defined. 

Proof:  Notice  that 

Jt(0)  =  Jt(0o,d-f{9)). 

One  can  check  that  Jt  is  lower  semicontinuous  (uniform  convergence)  and  for 
each  M  >  0,  [0  £  f V1  :  JfiO)  <  M}  is  compact  (uniform  topology).  Hence  there 
exists  0 *  £  IV1  such  that 

Jt(0*)  =  min  Jt(0). 
v  ’  eenn  ' 

Then  ( 6*o,0 *  —  f(0*))  is  a  minimum  energy  pair.  The  remaining  assertions  are 
easily  verified.  I 

Example  ( Linear  Systems;  Krener  [38]  )  Consider  the  system 

x  =  Ax  +  Nu,  x(0)  =  xq,  (4-12) 

if  —  Cx  +  Rv, 
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with  energy  or  cost  function 


Jt{x o,  u)  =  ~(xQ  -  moyPo(x0  —  m0) 

+  |  u(s)  |2  |  R-'Cxis)  |2 

Jo  2  2 

where  P0  >  0,  R  >  0,  and  rank  TV  =  n.  The  value  function  has  the  explicit  form 

W{x,t)  =  ^(x-x(t))’P(t)(x-x(t))+£  X-  |  BTxCx{s)  |2  -y(s)(i2i2')-1C'a(3)<i3, 

where  x(t)  is  the  deterministic  estimate,  given  by 

£(t)  =  Ax^  +  Pity'CXRR'y'O/iO-Cxit)),  x(0 )  =  m0,  (4.13) 

and  P(t)  is  the  positive  definite  solution  of  the  Riccati  equation 

P(t)  =  —P(t)A  —  A'P(t)  —  P(t)NN'P(t)  +  C\RR!)~lC,  (4.14) 

P(  0)  =  Po. 

Thus  the  deterministic  estimator  for  the  linear  system  (4.12)  is  well  defined 
and  has  a  finite  dimensional  realisation  (4.13),  (4.14).  /// 

4.2  Viscosity  Solution 

We  now  turn  to  the  IIJB  equation  (4.7).  If  Y  £  then  the  Hamiltonian  H 
(4.8)  is  in  general  only  measurable  in  t.  In  this  section  we  restrict  ourselves  to 

y,  e  n£  =  {,  e  c’(M, b)  ■.  ,(o)  =  o} . 

This  space  is  equipped  with  the  norm 

||  V  lie1  =  sup  |  rj(s)  |  . 

0  <s<t 

Then  H  £  C{Mn  x  [0,T]  x  1 Rn-,  1R). 
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Now  we  prove  that  W(x,t)  is  the  unique  viscosity  solution  of  the 
Iiamilton-Jacobi-Bellman  equation  (4.7).  Our  assumptions  imply  that  UXtt  /  0 
for  all  x  E  Mn,  0  <  t  <  T,  and  consequently  W(x,t)  <  oo.  The  analysis  does 
not  assume  existence  of  optimal  controls. 

The  following  definition  is  taken  from  Crandall,  Evans  and  Lions  [9].  Write 
C  =  C(lRn  x  (0,T),  JR),  and  similarly  for  C 1  =  Cx{lRn  x  (0,T),JR). 

Definition  Let  W  E  C.  We  say  that  W  is  a  viscosity  subsolution  of  (4.7) 
provided  that  for  all  <f>  E  C1  the  following  property  holds: 

if  W  —  <j>  attains  a  local  maximum  at  a  point  ( x,t )  6  JR"  x  (0,T),  then 

§-tf(x,t)  +  H(x,t,D(f>(x,t ))  <  0.  (4-15) 

We  say  that  W  is  a  viscosity  supersolution  o/(4.7)  provided  that  for  all  (j>  £  C1 
the  following  property  holds: 

ifW  —  f  attains  a  local  minimum  at  a  point  ( x,t )  E  Mn  x  (0,T),  then 

§i<f>{x,t)  +  H(x,t,Df(x,t))  >  0.  (4.16) 

If  W  is  both  a  viscosity  subsolution  and  supersolution,  we  say  that  W  is  a  vis¬ 
cosity  solution  of  ( 4.7). 

Lemma  4.1  (Principle  of  Optimality)  Let  0  <  t  —  h  <  t.  Then 
W(x,t)  =  inf  \w(xu(t-h),t-h)  +  f  L(xu(s),u(s),s)ds\ .  (4.17) 

(x0,u)£UXlt  l  Jt—h  J 

Proof:  Let  ( x0,u )  E  Ux,t  and  choose  (xq,u)  E  UXu{t-h),t-h •  Define 


u(s) 


u(s)  0  <  s  <  t  —  h 

< 

u(s)  t  —  h  <  s  <t. 
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Then  u  E  Ux<t,  and  hence 


rt-h 

W(x,t)  <  S0(x0)  +  /  L(xu(s),u(s),s)ds 

Jo 

+  /  L(xu(s),u(s),s)ds. 

Jt-h 

Taking  the  infimum  of  the  right  hand  side  over  (xo ,u)  E  Uxu(t-h),t-h  we  obtain 
W(x,t)  <  W(xu(t  —  h),t  —  h)  +  J  L(xu(s),u(s),s)ds. 

Now  let  6  >  0.  Choose  (5o,  u)  E  ldx,t  such  that 

W(x,t)  +  6  >  So(xu(0))  +  J  L(xa(s),u(s),s)ds 

>  W(xu(t  —  h),t  —  h)  +  ^  L(xu(s),u(s),  s)ds 

>  inf  {w(xu(t  —  h),t  —  h)+  [  L(xu(s),u(s),s)ds\ . 

( xo,u)€.UXlt  l  Jt—h  J 

These  inequalities  imply  (4.17).  | 


Fix  (x,t)  and  choose  7  >  W{x,t).  Define 


l£,t  =  {(»o,«)  €Ux,t  ■  Mxo,u)<l}, 
Bt  =  {x  E  IRn  :\x-  x1  \<  e}. 


Lemma  4.2  Fix  e  >  0.  Then  there  exists  77  >  0  such  that  if  ( Xq,u )  E  lQyi  then 
xu(t  —  h)  E  Bt  for  all  0  <  h  <  rj. 

Proof:  Note  that  xu{t )  =  x  E  Bc.  Define 

rju  —  sup{/i  >  0  :  jc^s)  G  Bt  for  all  s  G  [t  —  h,t]}- 

Then  |  xu(t  —  rju)  —  x  \=  e.  Let 

r)  =  inf  t}u. 

(a;0,«)  e 
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We  want  to  show  that  r/  >  0.  Suppose  not;  q  =  0.  Then  there  is  a  sequence 


(a’o,  un)  £  U2,t  with  qUn  — »  0  as  n  — >  oo.  Write  xn  —  xUn ,  etc. 

Now  /  is  continuous,  so  there  is  a  constant  K  >  0  such  that  |  f(x')  |<  K  for 
all  x1  £  Bt.  Then 


0  <  £ 


x  -  xn(t  -  Tjn) 

r* 

It— Tin 


<  /  (\  f(xn(s))\  +  \un(s)\)ds 

Jt-Tln 

<  Kr]n  +  f  |  un(s)  |  ds 

dt—rjn 

Choose  No  >  0  such  that  n  >  No  implies  Krjn  <  e/2.  Then 
0  <  e/2  <  f  |  un(s)  |  ds  for  n  >  N0. 

Jt-Vn 

(Note  that  if  U  is  bounded,  then  the  lemma  follows  from  this  inequality.) 

Next,  since  (a:o,un)  £  U2tt  it  follows  that 

f  |  u(s)  |2  ds  <  7. 

Jt-Vn 

Then 

0  <  e/2  <  f  |  un(s )  |  ds 

J  Vn 

<  y/yq2  for  n  >  N0, 

using  the  Cauchy-Schwarz  inequality,  which  is  impossible  since  — *  0.  Con¬ 
sequently  r]  >  0  and  the  lemma  is  proved.  i 


Theorem  4.1  The  value  function  W(x,t)  defined  by  (4.6)  is  the  unique  viscosity 
solution  of  the  Hamilton- Jacobi-Bellman  equation  (4.7). 

Proof:  First  we  show  that  W(x,t)  is  a  viscosity  subsolution.  Let  f  £  C1  and 
suppose  that  W  —  4>  attains  a  local  maximum  at  (x,f).  Then  there  exists  e  >  0 
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such  that 


W(x,t)-<f>(x,t)  >  W(x',t')  -  <j>(x',t')  (4.18) 

for  all  x1  £  Be,  \  t  —  t'  |<  e. 

Choose  a  constant  control  u(s)  =  u  £  U.  There  is  an  x0  such  that  ( xo,u )  £ 
Ux,t •  Choose  0  <  8  <  e  such  that  xa(s)  £  Bt  for  |  t  —  s  |<  <5.  The  Principle  of 
Optimality  (4.17)  implies 

W(x,t )  <  W(xu(t  —  h),t  —  h)  +  L(xu(s),u(s),s)ds.  (4.19) 

If  0  <  h  <  6,  then  (4.18)  gives 


W(x,t)  —  <f>(x,t)  >  W(xu(t  —  h),t  —  h)  —  <f>(xu(t  —  h),t  —  h).  (4.20) 


Combining  (4.19)  and  (4.20)  we  obtain 

</>{xz(t  ~  h),t  —  h)  —  cf)(x,  t)  1  C 


-h 


I  r 

-  L(xu(s),u(s),s)ds  <  0. 

II  Jt—h 


Letting  h  —>  0  we  have 


■7^<f>(x,t)  +  D<f)(x:t)  ■  ( f(x )  +  u)  —  L(x,u,t)  <  0. 


But  this  holds  for  all  u  G  U,  hence  (4.15)  and  so  W{x,t)  is  a  subsolution  of  (4.7). 

To  see  that  W(x,t)  is  a  viscosity  supersolution,  let  <j>  G  C1  and  suppose  that 
W  —  <j>  attains  a  local  minimum  at  ( x,t ).  Then  there  exists  an  e  >  0  such  that 

W(x,t)  —  4>(x,t)  <  W(x',  t')  —  <f>(x',  t', )  (4-21) 


for  all  x'  G  Be,  \  t'  —  t  |<  e. 

Suppose,  contrary  to  (4.16),  that  there  exists  a  9  >  0  such  that 
§-t<f>( x,  t)  +  H(x ,  t ,  D(j>(x,  t))  <  -0  <  0. 
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By  continuity,  reducing  t  >  0  if  necessary, 


t')  +  max  {D<f)(x',  t ')  (f(x')  +  u)  —  L(x\  u,  t')}  <  —0  <  0  (4.22) 

u  E  U 

for  all  x1  £  Be ,  1 1  —  t'  |<  e.  Let  7  >  bP(a:,i)  and  let  r)  be  given  as  in  Lemma  3.2. 
By  the  Principle  of  Optimality  (4.17)  we  have 

W(x,t)  =  inf  {W(xu(t  —  h),t  —  h)  +  /  L(xu(s),  u(s),  s)d.s}.  (4.23) 
(x0,u)  e  u2, t  Jt-h 

Let  0  <  h  <  r]  A  e,  and  choose  (£0,  u )  £  U2)t  such  that 

/■t  Qfi 

W(xu(t  -  h),t  -  h)  +  Jt  ^L(xu(s),u(s),s)ds  <  W(x,t)  +  — .  (4.24) 

Since  xu(t  —  h)  £  Bc,  we  have  from  (4.21) 

W(xu(t  —  h),t  —  h)  —  <f>(xu(t  —  h),t  —  h)  >  W(x,t)  —  <j>(x,t).  (4.25) 


Combining  (4.24)  and  (4.25)  we  have 


0  <f)(xu(t  -  h),t  -  h)  -  <f>(x,t) 
2  “  —h 


1  B 

7  /  Zf(®„(s),u(s),s)ds. 

/i  Jt—h 


(4.26) 


However,  for  t  —  h  <  s  <  t,  xu(s)  £  Be  and  |  t  —  s  |<  e,  and  so  from  (4.22)  we 
have 


ft(j)(xu(s),s)  +  D(j)(xu(s),s)(f(xu(s))  +  u(s))  -  L(xu(s),u(s),s)  <  -6. 


Integrating,  we  obtain 


4>(x,t)  —  <f>(xu(t  —  h),t  —  h)  1  B 


/  L(xu(s),u(s),s)d, 

Jt—h 


s  <  —0. 


(4.27) 


But  (4.26)  and  (4.27)  contradict  each  other,  so  we  must  have  6  <  0;  proving 
(4.16).  Thus  W(x,t)  is  a  supersolution  of  (4.7). 

The  uniqueness  assertion  follows  from  Ishii  [27],  Theorem  1.  In  fact,  since 
So(x )  is  uniformly  continuous,  it  follows  from  [27]  that  W(x,t)  is  also  uniformly 
continuous.  1 
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4.3  Continuity 

In  this  section  we  prove  that  the  deterministic  estimator,  when  well  defined, 
depends  continuously  on  the  observation  path. 


Theorem  4.2  There  exists  a  positive  constant  C  such  that 

sup  |  W(x,t)[p]  -  W(a,/)[C]  |  <  C\\rj~(\\Hi 

(x,t)eRnx[0,T] 

for  all  rj,  £  €  Qff1 . 


Proof:  Let  ( x,t )  G  lRn  X  [0,T],  a  >  0,  and  rj,  (  G  CIq1  .  Choose  G  Ux,t 

such  that 


Jt(x0,u)[r]]  <  W(x,t)[r}]  +  a. 


Now 


Jt(x0,u)[r)}- Jt(x0,u)[(}\  =  |  jf  (£  ~  r]s)h(xu(s))ds 

<  Wv-C  IMI  h  || . 


Then 


W(x,  t)[(]  <  Jt(xo,  u)[£] 

<  Jt(x0,u)[r)]  +  C  \\r)  -  (  \\w 

<  W(x,t)[t]]  + C  \\  rj  -  (  \\Hi  +a. 

Since  a  >  0  was  arbitrary, 

W(x,t)[(]  <  W(x,  <)[,)  +  c\\v-  Cliff.  • 

Similarly, 

W(x,t)[i]\  <  w(x,t)[(\  +  C  II  T]  -  c  \\m  • 
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This  completes  the  proof. 


I 


Corollary  4.1  Assume  that  the  deterministic  estimator  is  well  defined.  Then 
the  map  7ff  defined  by  (4.10)  is  continuous. 


Proof:  Fix  0  <  t  <  T.  It  is  enough  to  check  that 

)[??;]  — >  x{t)[rj ]  in  IRn  as  rji  —*  r)  in  TIq  . 
Suppose  not.  Then  there  exists  6  >  0  such  that 

I  x(t)[T]i]  -  x(t)[r]]  |  >  8 

for  all  i  sufficiently  large.  Since  W(-,t)[rj\  has  a  unique  minimum, 

|  x  —  x(t)[r /]  |  >  6 

implies 

W(x,t)[r]]-W(x(t)[r}],t){r)]  >7, 
for  some  7  >  0.  Also,  we  have 


W(x(t)[r]i],t)[rii]  <  W(x,t)[r]i) 


for  all  x  G  JRn .  From  Theorem  4.2, 

I  W(x,t)[i n}-W(x,t)[rj]  |  < 

for  all  i  sufficiently  large. 

Now 


(4.28) 


W(x(t)[i]i\,t)[rj]  <  W(x(t)[rji},t)[r]i}+  \  W(x(t)[rn],t)[r]]  -  W(x(t) [17,-], t)[r}i] 

<  W(x(t)[f]],t)[rii\  +  | 

<  W{x(t)[r]],t)[ri]  + 

contradicting  (4.28). 
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4.4  Approximate  Deterministic  Estimation 


In  this  section  we  wish  to  motivate  the  observer  design  appearing  in  Chapter 
6.  In  Chapter  5  we  will  show  that  the  deterministic  filter  is  the  limiting  fil¬ 
ter.  Thus  in  terms  of  the  asymptotic  filtering  approach  to  observer  design,  the 
deterministic  estimator  is  a  candidate  observer.  Unfortunately  from  a  practi¬ 
cal  point  of  view,  the  deterministic  estimator  is  generically  infinite  dimensional. 
Also,  since  W(x,t)  is  in  general  only  Lipshitz  continuous,  the  deterministic  es¬ 
timate  is  difficult  to  compute.  Thus  an  approximate  finite  dimensional  method 
of  deterministic  estimation  is  desirable. 

Suppose  that  W(x,t)  is  a  smooth  solution  of  the  HJB  equation  (4.7),  and 
the  deterministic  estimator  is  well  defined.  Then  (4.9)  implies 

DW(x(t),t)  =  0. 

Differentiating,  we  get 

D2W(x(t),t)i{t)  +  D§-tW(x{t),t)  =  0. 

Using  (4.7)  this  gives 

D2W{x(t),t)'x{t)  =  D2W{x{t),t)f{x{t))  +  Dh{x{t)){m~K^))) 

+DW{x(t),t)(Df{x(t))  +  D2W(x(t),t)). 

Therefore 

x(t)  =  f(x(t))  +  -  h(x(t)), 

£(0)  =  m0, 

where 

P(t)  =  D2W(x(t),t). 
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A  similar  calculation  yields 

P(t)  =  -P(t)A(x(t))-A(x(t))'P(t)-P(t)2  +  H(x(t))'H{x(t)) 

+D3W(x(t),t)(x(t)  -  f(x(t )))  -  D2h(x(t))(y(t)  -  h(x(t)), 

where 

A(x)  =  Df(x),  H(x)  =  Dh(x). 

For  linear  systems,  this  last  equation  reduces  to  the  Riccati  equation  (4.14). 

Continuing  in  this  way  one  finds  in  general  that  x(t)  cannot  be  computed  by 
a  finite  set  of  ODEs.  Moreover,  in  general  D2W  does  not  exist  ( D2W  exists  a.e. 
if  W  is  convex  in  a:). 

If  we  omit  the  high  order  terms,  we  obtain  the  system 

m(t)  =  f(m(t))  +  P(*)-1i/(m(t))'(y(t)  -  h(m(t)))  (4.29) 

m(0)  =  ra0, 

P(t)  =  —P(t)A(m(t))  -  A(m(t))'P(t)  -  P{t )2  +  H(m(t))'H(m(t)), 

P(  0)  =  P0  =  D2So(mo)>0.  (4.30) 

The  system  (4.29),  (4.30)  is  called  the  approximate  deterministic  estimator.  This 
is  the  deterministic  analogue  of  the  extended  Kalman  filter  (c.f.  Mitter  [45]), 
and  is  not  what  one  would  obtain  by  linearising  the  system  at  an  equilibrium 
position. 

We  can  interpret  this  in  terms  of  a  HJB  equation  and  hence  an  optimal 
control  problem.  Define 

Wa(x,t)  =  ^(x  -  m(t))'P(t)(x  -  m(t ))  +  J  ~h(m(s))2  -  y(s)h(m(s))ds. 

Here,  a  denotes  a  measure  of  the  nonlinearity: 

a  =  max  (||  D2 f  ||,  ||  D2h  ||)  . 
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One  expects  that  Wa  — »  W  as  a  — »  0. 

Then 

m(<)  =  “fgl  W"(M) 

(this  is  well  defined)  and  W"  solves  the  HJB  equation 

§iW°‘(x,t)  + Ha(x,t,DWa)  =  0  (4.31) 

Tra(z,0)  =  S0(x) 

where 

Ha(x,t,  A)  =  H(x,t,X)  +  Va(x,t) 

Va(x,t )  =  ( x  —  mt)'Pt  [  f  rD2  f(mt  +  rs(x  ~  mt))(x  —  mt)2drds 

Jo  Jo 

-  fo  fo  r  [I  H(mt  +  rs(x  -  m<))  I2  + 

H(mt  +  rs(x  —  mt))' D2h(mt  +  rs(x  —  mt))]  (x  —  mt)2drds  + 

\  |  H(mt)(x  -  mt)  |2  +  fo  fo  rD2h(mt  +  rs(x  -  mt))(x  -  mt)2drdsyt. 

The  optimal  control  problem  corresponding  to  (4.31)  has  Lagrangian  La(x,u,t ) 
=  1/2  |  u  |2  +1/2 h(x)2  —  y(t)h(x )  —  Va(x,  t).  This  may  be  useful  in  comparing 
x(t)  and  m(t)  when  a  is  small.  We  do  not  pursue  this  further  here. 
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Chapter  5 

Small  Parameter  Asymptotics 


We  study  the  asymptotics  as  c  — >  0  of  the  filtering  problem  introduced  in 
Chapter  2.  This  entails  studying  the  asymptotics  of  the  corresponding  Zakai 
ecpiation,  in  both  its  strong  and  weak  form.  We  obtain  estimates  showing 
qc(x,t )  x  exp  in  probability  as  e  — »  0  for  the  unnormalised  condi¬ 

tional  density  using  PDE  methods,  where  W(x,t)  is  the  value  function  arising 
in  Mortensen’s  deterministic  estimation  for  the  limiting  observation  path.  Also, 
we  prove  cr\(A)  x  exp  j  inf^g^  W{x ,  t)')  in  probability  as  e  — >  0  for  the  unnor¬ 
malised  conditional  measures,  directly  using  probabilistic  methods. 

Hijab  [25]  has  studied  this  asymptotic  estimation  problem,  and  he  obtained 
a  WKB  expansion  when  W(x,t)  is  smooth.  He  identified  the  limiting  filter  as 
Mortensen’s  deterministic  estimator.  In  addition,  Hijab  [26]  has  proved  a  LDP 
for  pathwise  conditional  measures,  which  we  recall  below. 

We  obtain  a  LDP  in  probability  for  the  conditional  measures,  and  prove 
that  they  converge  weakly  in  probability  to  the  Dirac  measure  concentrated  on 
the  deterministic  trajectory.  In  the  case  that  the  deterministic  estimate  is  well 
defined,  we  prove  that  the  pathwise  stochastic  filters  converge  weakly  to  the 
deterministic  estimator. 


5.1  PDE  Method 

We  consider  the  family  of  diffusion  processes  in  IRn  (2.2)  with  real  valued  obser¬ 
vations  (2.3).  Assume  as  before  that  /  G  Cl(IRn ,  lRn),  h  G  C£(IRn ,  1R),  and  x^ 
has  density  pjj  given  by  (2.1). 
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As  e  — »  0  the  trajectories  of  (2.2)  converge  in  probability  to  the  trajectory 
X  =  {x(t)  :  0  <  t  <  T}  of  the  deterministic  system  (2.35)  with  initial  condition 
x0,  and  the  trajectories  of  (2.3)  converge  in  probability  to  Z  —  { z(t )  :  0  <  t  < 
T}  £  rio»  where 

z{t)  =  /  h(x(s))ds.  (5.1) 

Jo 

Let  qe  be  the  solution  of  the  Zakai  equation  (2.27).  Our  objective  in  this 
section  is  to  obtain  the  asymptotic  formula 

qe(x,t)  =  exp^-^(kL(a;,i)  +  o(l))^  ,  (5.2) 

in  probability  as  e  — >  0,  where  W(x,t)  is  the  value  function  arising  in  determin¬ 
istic  estimation  for  the  observation  path  Z  £  Q0  (see  (5.29)  below). 

5.1.1  A  Hamilton- Jacobi  Equation 

Fix  Y  £  fi0  and  denote  by  pe(x,t)  =  pe(x,t)[Y]  the  solution  of  the  robust  Zakai 
equation  (2.31).  We  will  recover  the  probabilistic  interpretation  in  Section  5.1.3. 

Following  Fleming  and  Mitter  [18],  who  considered  filtering  problems  with 
e  =  1,  we  apply  the  logarithmic  transformation 

Sc(x,t)  =  — elog  pc(x,t)-  (5.3) 

Then  Sc(x,t)  satisfies 

§-tS*(x,  t )  -  ^  A S*(x,  t)  +  FP(x,  t,  DS\x ,  t))  =  0,  (5.4) 

S^XjO)  =  S0(x)  -  elog(7£, 

where 

H‘(x,t,  A)  -  X-g(x,t)  +  ±\\\2-V'(x,t),  (5.5) 
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Ve(x,t)  =  Ve(x,ij(t))  + edWg(x,y(t)), 
<?0M)  =  g(x,y(t)). 


Here  Vt(x,y)1  g(x,y)  are  defined  by  (2.13),  (2.16).  Equation  (5.4)  is  a  nonlinear 
parabolic  PDE,  which  can  be  interpreted  as  the  Bellman  equation  for  a  stochastic 
control  problem  [18]. 

Formally  letting  e-tOwe  obtain  a  Hamilton-Jacobi  equation 

§-tS(x,t)  +  H(x,t,DS(x,t))  =  0,  (5.6) 

S(z,0)  =  S0(x), 

where 

A)  =  A  •  g(x,t)  +  ~  |  A  |2 -V(x,t),  (5.7) 

V(x,t)  =  i/i(x)2  +  y(t)Dh{x)f{x)  -  ij/(f)2  |  Dh(x)  |2  .  (5.8) 

Note  that  Vc  — >  V  and  Hc  — »  H  uniformly  on  compact  subsets. 

We  shall  interpret  solutions  of  (5.6)  in  the  viscosity  sense.  In  fact,  S(x,t)  can 
be  interpreted  as  the  value  function  for  an  optimal  control  problem.  Consider 
the  dynamics 

x  =  g(x,s)+u ,  x(0)  =  x0-  (5.9) 

We  wish  to  minimise 

It(xo,u)  =  So(xo)  +  fQ  Q  I  u(s)  I2  +E(xu(s),s))  ds.  (5.10) 
Denote  by  Tx,t  the  corresponding  class  of  admissible  pairs  (x0,u).  Define 

S(x,t )  =  inf  It(x0,u).  (5.11) 

(x0,u)  e  tx, t 

The  arguments  in  Chapter  4  can  be  used  to  prove  the  following. 
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Theorem  5.1  The  value  function  S(x,t )  defined  by  (5.11)  is  the  unique  viscosity 
solution  of  the  Hamilton- Jacobi  equation  (5.6).  Further,  if  K  C  Oo  and  Q  C  Mn 
are  compact,  then  there  exists  C  >  0  such  that 

sup  |  S(ar,<)[j7]  -  S(x,t)[(]  \  <  C  ||  rj  -  (  || 

(a;,i)eQx[0,T] 

for  all  rj,  (  £  K . 


Our  main  task  is  to  prove  that  Se  — >  S  as  e  — >  0  uniformly  on  compact  sub¬ 
sets.  From  this  the  asymptotic  formula  (5.2)  will  follow  (Theorem  5.3).  Before 
proceeding,  we  define  an  extension  of  the  deterministic  estimator  to  all  of  flo  as 
follows: 

For  Y  €  fio,  define 

W(x,t)[Y]  =  S(x,t)[Y]-y(t)h(x).  (5.12) 


If  FF(-,  t)  has  a  unique  minimum  for  each  0  <  t  <  T  and  each  Y  £  flo,  we  say  as 
before  that  the  deterministic  estimate 


is  well  defined.  Note  that  if  Y  £  Oq  1 ,  then  W[Y]  satisfies  the  HJB  equation 
(4.7).  This  defines  a  continuous  map 

7ft  :«o -  V(Rn)  (5.13) 

Yt  ^£(<)[y(] 


which  extends  (4.10). 


5.1.2  Some  Estimates 

Let  Se(x,t)  be  the  solution  of  (5.4).  In  this  section  we  obtain  estimates  for 
I  se  I,  I  DSt  I  and  for  the  Holder  norm  of  Sc  in  t  £  [0 ,T]  on  compact  subsets 
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independent  of  the  parameter  e.  These  estimates  will  be  used  in  Section  5.1.3 
to  prove  that  Se  — >  S. 

Lemma  5.1  For  every  compact  subset  Q  C  lRn  X  [0,  T],  there  exists  e0  >  0  and 
K  >  0  such  that  for  0  <  e  <  eo  we  have 

I  Sc(x,t)  |  <  K,  for  all  (x,t)  £  Q,  (5-14) 

|  DSe(x,t)  \  <  K,  for  all  (x,t)eQ.  (5.15) 

To  prove  (5.14),  we  use  a  comparison  theorem  which  depends  on  the  max¬ 
imum  principle  for  linear  parabolic  PDE.  Let  Br  C  Mn  denote  the  closed  ball 
centred  at  0  with  radius  R  >  0,  write  Tr  =  Br  x  {0}  U  8Br  x  [0,  T]  and  define 
Qr  —  Br  x  [0,  T],  denoting  by  Q%  its  interior. 

Lemma  (Maximum  Principle,  Friedman  [23])  Define 

Cw  =  — -Aw b  ■  Dwc, 

Li 

where  P  is  smooth.  If  Cw  <  0  (Cw  >  0)  in  Q°R,  then 

w(x,t)  <  sup  w(z,s)  inf  tn(z,s)  <  m(x,t) 

(*,s)  e  rB  e  r*  J 

for  all  (x,  t)  £  Qr. 

Lemma  5.2  (Comparison  Theorem)  Let  Se  be  a  solution  of  (5.4),  and  define 

Cv  =  §-tv  -  ^ An  +  g  ■  Dv  +  -  |  Dv  \2  -Vc. 

Let  w  —  v  —  Sc.  If  Cv  >  0  (Cv  <  0)  in  Q°R)  and  if  Se  <  v  (v  <  Se)  on  FR,  then 
Se  <  v  (v  <  Se)  in  Q°r. 
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Proof:  If  Cv  >  0,  then 

w  +  Dwg  +  \  (|  Dv  |2  -  |  DSe  |2)  >  0. 


dt  2 


Now  |  Dv  |2  -  |  DSe  \2=  Dw  (Dv  +  DSC)'.  Set 

¥  =  g  +  ±(Dv  +  DS‘). 

Then  Cw  >  0  and  on  T/j,  w(z,  s)  >  0.  Hence  w(x,t)  >  0  for  all  (x,t)  £  Qr  by 
the  maximum  principle.  I 

Proof  of  Lemma  5.1:  1.  We  now  construct  a  function  v  such  that  Cv  >  0 

in  Qr  and  Sc  <  v  on  Tr,  independent  of  (sufficiently  small)  e  >  0  (Evans-Ishii 
[16]).  Define 

v(x,t)  =  p,  \  _  +  lit  +  Af, 


R?-  |  x  |2 

where  the  constants  g  >  0,  M  >  0  are  to  be  chosen 
We  write  Vi  for  vXi ,  etc.  Then 


(5.16) 


Cv 


2  n 


/*“ o 


+ 


8  I  x 


+  E 


2  \(R 2-  |  *  |2)2  (i?2-  |  x  |2)3 

2  g%xi 


+ 


2  | 

x  |: 

2 

(R2- 

X 

I2)4 

V‘ 


>  g  —  tC 


1 


+ 


(i?2-  |  x  |2)2  (# 

2  I  x  I2 


X 

2 

(R2- 

X 

I2)3 

(i?2-  |  x  |2)4 
>  0  in 


-c 


for  all  small  e  >  0,  provided  g  is  chosen  sufficiently  large.  Choose  M  so  large 
that 

S'o(x)  <  M  for  all  x  6  Br. 

Now  v(x,t)  — >  oo  as  |  x  |— >  R  uniformly  in  t  6  [0,T],  hence 

Se  <  v  in  Q°r, 
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and  since  v  is  continuous  in  Q°R ,  there  is  a  constant  K  >  0  depending  on  R  such 
that 

Se(x,t )  <  K  for  all  (x,t)  e  Qr/ 2, 

for  all  sufficiently  small  e  >  0. 

Similarly  we  can  find  a  lower  bound  for  Se  on  QR/ 2. 

2.  Next  we  estimate  the  gradient,  using  a  variant  of  the  techniques  used  in  Evans 
and  Ishii  [16],  as  suggested  to  us  by  L.  C.  Evans.  To  simplify  the  notation  we 
write  v  —  5%  which  from  (5.4)  satisfies 

Vt  -  | va  +  \-ViVi  +  Vig1  -  Ve  =  0,  (5.17) 

where  we  have  used  the  summation  convention.  Let  Q  CC  Q'  CC  Mn  x  (0,  T), 
where  Q,  Q'  are  open  and  “CC”  means  “compactly  contained  in”.  Choose 
C  €  Cf(IRn  x  [0,  T])  such  that  0  <  (  <  1,  (  =  1  on  Q  and  (  —  0  near  dQ\  and 
define 

2T  =  (^VkVk  -  \v  (5.18) 

where  the  constant  A  >  1  is  to  be  chosen. 

Suppose  that  z  attains  its  maximum  over  Q'  at  (x0,to)  £  Q' •  Then  we  have 

Zi  —  0  and  (5.19) 

0  <  zt-lza  (5.20) 

at  the  point  (x0,t0).  Then  at  this  point,  using  (5.20), 

0  <  2 ((tVkVk  +  2 C2vkVkt  ~  Xvt 

-e(i(iVkVk  -  e(CiiVkvk  -  4 e((iVkvki 

-e(2vkivki  -  e(2VkVkii  + 
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k 


<  -eC(2  I  D2v  I2  +2CV  (vt  -  ^Vi^j 
+A  Vt  +  —  Vii'j  +  C  |  Dv  |2 

for  e  sufficiently  small.  Using  (5.17)  we  find  that 

0  <  -vk  (C2ViV^jk  -  g1  (C2vkvk).  +  ~ViVi  +  C(  \  Dv  |3 
+C  |  Dv  |2  +XC  |  Dv  |  +XC. 

This  together  with  (5.19)  implies 

^  |  Dv  |2  <  C(  |  Dv  |3  +C  |  Dv  |2  +AC  1  Dv  \  +XC.  (5.21) 

Let 

A  =  /i[max  ((  |  Dv  |)  +  1],  (5.22) 

where  g  >  1  is  to  be  chosen.  Then 

C  I  Dv  I3  <  I  Dv  I2  [max  ((  |  Dv  |)  +  1], 
and  thus  from  (5.21), 

£  |  Dv  |2  <  C  |  Dv  |2  +[max  ((  |  Dv  |)  +  l]"^  |  Dv  |2  +gC  \  Dv  \  +Cg 
<  C  I  Dv  I2  +gC[\  Dv  I  +1]  (5.23) 

using  the  fact  that  [max((  |  Dv  |)  +  1]_1  <  1.  Choosing  g  so  large  that  g/ 4  < 
g/2  —  C ,  we  have  from  (5.23) 

|  Dv  |2  <  C[\  Dv  |  +1]  at  (x0,to)- 

Then,  because  g  >  1  and  0  <  £  <  1,  at  (xo,to)  we  have 

z  =  (2  \  Dv  |2  —An 

<  CC'IC  I  Dv  |  +1]  +  A C 

<  AC.  (5.24) 
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This  implies 


z  <  CX  in  Q'. 

If  it  happened  that  (xq,  to)  £  dQ',  then 

2  =  —An  <  CX  at  (xo,to), 
and  this  also  implies  (5.25).  But  from  (5.25), 

max  |  Dv  |2)  <  maxz  +  (7A  <  C*A, 
and  using  the  definition  (5.22)  we  have 

max  |  Dv  |2)  <  (7^[max(C  |  Dv  |)  +  1] 


which  implies 


and  hence 


C|  Dv  |  <  C  in  Q', 

\Dv\<  C  in  Q. 


The  following  Holder  estimate  is  a  modification  of  Lemma  5.2 
and  Lions  [7]. 

Lemma  5.3  For  every  compact  subset  Q  C  2Rn,  there  exists  eo  >  0 
such  that  for  0  <  e  <  e0  we  have 

|  Se(x,t)-S‘(x,s)  \<K(y/i\t-s  |1/2+  \t-s\) 


(5.25) 


I 

of  Crandall 


and  K  >  0 

(5.26) 


for  all  x  €  Q]  t,s  G  [0,  T). 
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Proof:  Let  Q  CC  Q'  CC  lRn,  and  choose  £  6  C£°(IRn)  such  that  (  =  1  on 


Q  and  (  =  0  near  dQ1.  Choose  eo  >  0  such  that  dist(9<5,  dQ')  >  \Ze0T  and  so 
that  (5.14),  (5.15)  hold  on  Q  x  [0,T]  for  0  <  e  <  cq.  Write 


u  =  (Se. 

Then 

=  lc(-i5‘5‘-915‘  +  V‘)+C,S<-eC.S|-|Cii5<l 

<  Kq  on  lRn  x  [0,T]. 

Let  pa  be  a  standard  modifier  for  0  <  a:  <  y/toT,  and  set  va  =  pa  *  v.  Then 
as  in  [7],  for  0  <  t  <  T, 

II  vt  <  K0  +  e  ||  v %  ||L°°(fl") 

<  Kq  H - II  Dv  ||/,oo(Rn)  . 

a 

Next,  for  x  E  JRN ,  t,s  6  [0,T], 

|  v(x,t)  —  u(x,s)  |  <  |  v(x,t)  —  va(x,t)  |  +  |  va(x,t)  —  u"(a:,s)  | 

+  |  va(x,s)  -  v(x,s)  | 

<  olK  +  ^/'fo  4 - — ^  |  t  —  s  | 

<  K(e1'2\t-s\1/2+\t-s\), 

on  our  setting  a  =  e1/2  j  t  —  s  p/2. 

Finally  note  that  v  =  S€  on  Q.  i 


5.1.3  Asymptotic  Result 


We  first  present  an  asymptotic  result  for  the  robust  Zakai  equation  for  a  fixed 
observation  record. 
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Theorem  5.2  FixY  E  Q0.  Letpc,Sc,S  denote  the  corresponding  solutions  of 
(2.31),  (5.4),  (5.6).  Then  under  the  above  assumptions,  we  have 

—  lime  log pe(x,t)  —  limS'e(x,i)  =  S(x,t )  (5.27) 

uniformly  on  compact  subsets  of  lRn  x  [0,  Tj. 

Proof:  Lemmas  5.1  and  5.3  imply  that  S<L  are  uniformly  bounded  and  equicon- 
tinuous  on  compact  subsets.  By  the  Arzela-Ascoli  theorem,  there  is  a  sub¬ 
sequence  6k  — *  0  such  that  S£k  converges  uniformly  on  compact  subsets  to  a 
continuous  function  S.  By  the  “vanishing  viscosity”  theorem,  Crandall  and  Li¬ 
ons  [6],  S  is  a  viscosity  solution  of  (5.6).  By  uniqueness,  S  =  S.  In  fact,  Se  — >  S 
as  e  — >  0.  1 

Recall  that  the  solutions  Xe,Y£  of  (2.2),  (2.3)  converge  in  probability  uni¬ 
formly  on  [0,  T]  to  the  solution  of  the  corresponding  deterministic  system,  and 
Z  e  flo  denotes  the  limit  of  the  process  Ye.  We  write  5£[T]  for  the  solution  of 
(5.4)  for  Y  £  fi„,  and  similarly  5[K]. 

Let  q€(x,t)  denote  the  solution  of  the  Zakai  equation  (2.27).  Then  by  (2.29), 
(5.3): 

-dog  q’(x,t)  =  S‘[K‘](:M)-!/‘(()M*).  (5,28) 

As  in  (5.12),  define 

W(x,t)  =  S[Z](x,t)  —  z(t)h(x).  (5.29) 

Note  that  Z  is  continuously  differentiable  and  so  W  is  the  value  function  for 
deterministic  estimation  with  observation  record  Zt  =  {2(3),  0  <  3  <  t}  E  Oq  . 

Lemma  5.4  Fix  Z  E  O0,  and  let  Q  be  a  compact  subset  of  lRn.  Then  for  all 
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f3  >  0  there  exists  7  >  0,  eo  >  0  such  that  if  ||  7  —  Z  ||<  7,  0  <  e  <  e0,  we  have 

sup  |  *S,e[7](£, t)  —  Sc\Z\{x,t)  |<  f3.  (5.30) 

xeQ,  0 <t<T 

Proof:  Let  ||  •  ||g  denote  the  supremum  norm  on  Q  x  [0,  T],  Suppose  the 

assertion  is  false.  Then  there  exists  fl  >  0,  a  subsequence  tj  — »  0,  sequences 
7j  — >  0,  rjj  £  flo  with  ||  rjj  —  Z  ||<  7 j  such  that 

II  -  S‘‘[Z]  Ho  >  D  >  0.  (5.31) 

for  j  =  1,2,...  Note  that  rjj  — >  Z  uniformly  on  [0,  T]  as  j  — >  00. 

An  inspection  of  the  proofs  of  Lemmas  5.1,  5.3  reveals  that  the  sequence 
is  uniformly  bounded  and  equicontinuous  on  Q  x  [0,  T].  These  proofs 
did  not  require  differentiation  of  the  coefficients  with  respect  to  t,  and  so  the 
estimates  can  be  made  uniform  in  the  parameter  rj  over  a  compact  subset  in  f l0- 
Thus,  as  in  the  proof  of  Theorem  5.2,  there  is  a  subsequence  jk  such  that 

limS£M%J  S[Z]  =  lim  Stik[Z). 

k—Kyo  k—>  00 

Letting  j  =  jk  — »  00  in  (5.31)  yields  a  contradiction.  1 

Corollary  5.1  Let  K  C  Llo  and  Q  C  lRn  be  compact.  Then  for  all  /3  >  0  there 
exists  eo  >  0  such  that  if  0  <  t  <  eo  then 

sup  |  5£(x,f)[7/]-5'(x,f)[7?]  |  <  /3 
(x,t)£Qx[0,T] 

for  all  rj  £  I(. 

Proof:  Using  the  fact  that  A[?/]  depends  continuously  on  7/  G  Q0,  we  can  argue 
by  contradiction  as  above.  I 
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Theorem  5.3  Under  the  above  assumptions ,  we  have: 

lime  log  qe(x,t)  =  —W(x,t)  (5.32) 

in  probability  uniformly  on  compact  subsets  of  JRn  x  [0,T]. 

Proof:  Since  Y£  — >  Z  in  probability,  in  view  of  (5.28)  it  is  enough  to  show  that 
p(Sc[Ye],  <SI[Z])  — >  0  in  probability,  where  p  denotes  a  metric  on  C(lRn  x  [0,  T];  1R ) 
corresponding  to  uniform  convergence  on  compact  subsets. 

Fix  8  >  0,  /3  >  0.  From  Lemma  5.4,  we  can  choose  7  >  0,  ei  >  0  such  that 

l,(S‘[Y],S‘[Z\)  <  0/2  (5.33) 

for  all  0  <  e  <  ei  and  all  ||  Y  —  Z  ||<  7. 

Choose  e2  >  0  such  that 

P  (\\Y‘ -  Z  \\>  ■,)  <  S  (5.34) 

for  all  0  <  e  <  e2.  From  Theorem  5.2,  choose  63  >  0  such  that 

,.(S'[Z],S[Z])  <  13/2  (5.35) 

for  all  0  <  e  <  63.  Set  0  <  e0  <  A  e2  A  e3. 

Then  if  0  <  e  <  e0, 

P{p(S*[Y%S{Z})>ft)  <  P(p(S£[Y%S‘{Z})>  /!/ 2;  ||  -  Z  ||>  7) 

<  8. 

I 

5.2  Probabilistic  Method 

Once  again  we  are  interested  in  the  filtering  problem  (2.2),  (2.3),  and  we  make 
the  same  assumptions  as  above. 
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Let  cr)  be  the  solution  of  the  weak  form  of  the  Zakai  equation  (2.26).  Com¬ 
plementing  formula  (5.2),  we  prove  in  this  section 

aet(A)  x  e“iinLe/i w(x,t)  (5.36) 

in  probability  as  e  — *  0,  where  W(x,t)  is  defined  by  (5.29).  The  proof  uses  the 
representation  formula  (2.28),  rather  than  the  results  of  the  previous  section. 


5.2.1  Varadhan’s  Theorem  “in  Probability” 

In  order  to  use  the  representation  formula  (2.28)  to  prove  (5.36),  we  need  an 
extension  of  Varadhan’s  Theorem  to  random  functions  Fe.  Recall  the  notation 
and  terminology  used  in  Section  2.2.  We  follow  the  development  in  Varadhan 

[55]. 


Theorem  5.4  Let  {P£}  obey  the  LDP  with  action  function  I .  Let  Fc  :  X  x  D  — > 
1R  be  a  family  of  random  functions  such  that  for  each  u>  £  Ll,  Fe(-,  u>)  :  X  — >  1R 
is  continuous.  Let  F  :  X  — >  1R  be  bounded  and  continuous.  Assume  that  for  all 
a  >  0,  /3  >  0  there  exists  ex  >  0  such  that  0  <  e  <  ex  implies 

P  (sup  |  Fe(x)  -  F(x)  |<  a  )  >1-/3.  (5.37) 

VxeX  J 

Then  if  C  C  X  is  closed,  G  C  X  is  open,  we  have:  for  all  7  >  0,  6  >  0  there 
exists  e0  >  0  such  that  0  <  e  <  e0  implies 


P 

P 


/  exp 

t 

I  i- 

Jc 

L  e  j 

r 

r  1  i 

/  exp 
Jg 

-7F‘(x) 

-  inf  [/(*)  +  F(x)]  +  6) 
-inf[/(x)  +  F(x)]-«) 


>1-7,  (5-38) 
>1-7.  (5.39) 
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To  prove  this  theorem,  we  use  the  following  lemma.  This  lemma  is  more 


general  than  required. 


Lemma  5.5  Let  Fc  be  a  family  of  random  functions  on  X.  Let  F  be  a  lower 
semicontinuous  function  onX.  Assume: 

(i)  ///(x)  <  oo,  then  there  exists  e'  >  0,  r  >  0  such  that  if  Nx  =  {y  6  X  : 
c/(x,  y)  <  r}  and  0  <  e  <  e'  then 

P  (  sup  (~Ft(  y))  <  —F(x)  -fa]  >  1  —  0.  (5.40) 

Vye^x  J 

(ii)  For  all  p  >  0  there  is  an  L  >  0,  e1'  >  0  such  that  0  <  e  <  e1'  implies 


P  [  inf  Fc(x)  >  -L  )  >  1-/9. 


(5.41) 


Then  for  all  7  >  0,  6  >  0  there  exists  eo  >  0  such  that  0  <  e  <  eo  implies 


log/x 


P  (  e  log  j ^  exp 


r  1 


Fe(x)  Pe(dx)  <  -  inf [I(x)  +  F(x)]  +  6)  >1-7.  (5.42) 
e  J  xeX 


Proof:  Choose  7>0,  ^>0;  0<p<  7/2.  From  hypothesis  (ii),  let  L ,  t"  be 
such  that  (5.41)  holds.  Write 

A  =  mf[/(x)  +  F(x)]. 
xgX 

Choose  M  >  0  such  that  L  —  M  <  —A,  and  let  K  =  {x  G  X  :  J(x)  <  M}.  Let 
{xij^i  be  a  dense  subset  of  K.  Set  a  =  6/ 5,  /?  =  7/2. 

By  hypothesis  (i),  for  each  i  —  1,2,.. .,  there  exists  e',  ?y,  TV/  =  {y  :  d(x/,  y)  < 
r4},  such  that  if  0  <  e  <  then 

P{Aht)  >  1-0/2’' 
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for 


AitC  =  <  sup(-F£(y))  <  -F(x.i)  +  a\  . 

[y  eNi  ) 

Since  I  is  lower  semicontinuous,  reducing  r,-  if  necessary,  if  y  €  Ni  then 

/( y)  >I(xi)-a. 

Let  Gi  —  {y  :  d(x,-,  y)  <  r,-/ 2}.  Since  {x,-}  is  dense  in  JL,  and  since  K  is 
compact,  K  C  Ujli  Gi  =  G  for  some  positive  integer  iV.  Fix  i  6  {1, . . . ,  At}. 

Now  {P£}  obey  the  LDP  with  action  function  /,  so  there  is  an  0  <  e*  <  e'- 
such  that  0  <  e  <  e;  implies 

elogP t(Gi)  <  —  inf  /(y)  +  a 

y  eGi 

<  -/(xj)  +  2a. 

Then  on  A;>e,  if  0  <  e  <  e,-, 

Qe(Gi)  =  /  exp  f — — J7'e(x)l  Pe(dx) 

<  exp[-iyinf y‘(y)]p'(ft) 

<  exp  —  a)  Pf(G;), 

and  hence 

elog  Qe(Gi)  <  -F(xj)  +  a  +  elog  P£(Gi) 

<  —A  +  3a. 

Let  Ak,c  =  Dili  Ai,e.  Then  there  exists  tK  >  0  such  that  if  0  <  e  <  eK,  then 

P(Ak,<)  >  l-EPI?  >  1-7/2. 

i— 1 

and  elog  N  <  a.  So  on  AkjC,  if  0  <  e  <  e#, 

elog  Qe(G)  <  e log  Ar  +  max  elog  Q£(Gi) 

<  — A  +  4a. 
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Let 


Bk,c  —  Ai<,t  fl  (  inf  Fe(x)  >  —  L 
IxeX 

Then  there  exists  e0  >  0  such  that  0  <  e  <  e0  implies 

P(BKtC)  >  1-7/2  -p 

>  1-7, 


and  elog2  <  a.  Thus  on  Bj< )£,  if  0  <  e  <  e0, 

elogQ6(X)  <  clog 2  +  max  (dog  Qe(G),  elogQc(Gc)) 

<  a  +  max  (—A  +  4a,  L  —  M ) 

<  —A  +  5a. 


Consequently 


P  (elogQ£(X)  <  —A  +  8)  >  P(BKit) 

>  1-7 


as  required.  §j 

Proof  of  Theorem  5.4:  Upper  bound.  Let  C  C  X  be  closed.  Define  Fe,  F  to 
equal  Fc,  F  on  C  respectively,  and  to  equal  +00  off  C.  Then  F€,  F  are  lower 
semicontinuous  functions. 

Let  x  €  C.  Then  (5.37)  and  the  continuity  of  F  on  C  imply  (5.40).  If  x  ^  C, 

(5.40)  is  obvious.  Since  Fe  >  Fc  on  X,  (5.37)  and  the  boundedness  of  F  imply 

(5.41) .  Then  applying  Lemma  5.5  we  obtain  (5.38). 

Lower  bound.  Let  GcXbe  open.  Assume 


A g  =  inf  [/(x)  +  -F(x)]  <  00. 
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Set  a  =  8/3,  /3  —  7.  Choose  z  6  G  such  that 

/( z)  +  F(z)  <  Ag  +  a. 

Our  hypotheses  imply  there  exist  eo  >  0,  r  >  0  such  that  if  Nz  =  {y  : 
d(z,  y)  <  r}  and  0  <  e  <  eo,  then 

P(Az,£)  >  1-/9 

for 

Az,e  =  (  sup  (-F£(y))  >  -F(z)  -  a]  . 

lyewz  J 

Reducing  e0  if  necessary, 

elog  Pe(Nz)  >  -MI(x)-a 
>  — 7(z)  —  2a 

provided  0  <  e  <  e0. 

Then  on  Az<e,  if  0  <  e  <  e0, 

QC(G)  =  /  exp  f--F£(x)l  Pe(dx) 

>  exp  j^-^(F(z)  +  a)  Pe(Nz), 

and  hence 

elog  Qe{G)  >  -F(z)  -  a  +  elogPE(Arz) 

>  -[J(z)  +  F(z)]-2a 

P  — Ag  —  3a. 


Hence 


as  required. 


P  (e  log  Cr(G')  >  —  Ag  —  8)  >  P(Az,e) 

>  1  —  7 


I 
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5.2.2  Asymptotic  Result 

Recall  that  =  {y\  :  0  <  s  <  T}  converges  in  probability  to  Z  =  {zs  : 
T}.  Fix  0  <  t  <  T,  define  the  random  functions 

Fe(e,u)  =  X-  J*  h(0s)2ds  -  J*  h(Oa)dyl(u>),  ( 9  G  17",  oj  €  17) 

and  the  function 

F(0)  =  l  fh(9s)2ds-  fh(6s)zsds. 

I  Jo  Jo 


Lemma  5.6  For  all  a  >  0,  (3  >  0  there  exists  ei  >  0  such  that  0 
implies 

P  (  sup  |  Fc{0)  -  F(0)  |>  a)  <  j3. 

\oeQn  ) 


Proof:  From  (2.3), 

/  h(0s)dyl  =  f  h(9s)h(xcs)ds  +  y/e  f  h(0s)dvs. 
Jo  Jo  Jo 

Then  using  the  Chebeshev  inequality, 

P  (sup0Gn„  |  fo  h(0s)h(xes)ds  -  h(0s)zsds  \>  a ) 

<  ^EsxvpdeUn  I  Jo  h(0s)[h(xes)  -  h(xs)]ds  |2 
<  &  II  h  ||2  Up{h)2TeC, 

where  the  last  inequality  follows  from  Lemma  2.1. 

Again  using  the  Chebeshev  inequality, 

P  (  sup  yfe  |  /  h(0s)dvs  \>  a )  <  —  E  sup  |  f  h(9s)dvs  |2 
\eecin  Jo  )  ol  eenn  Jo 


0  <  s  < 

(5.43) 

(5.44) 

.  e  ^  6\ 

(5.45) 


(5.46) 
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Choose  a  sequence  of  random  variables  0l  so  that 


sup  |  [  h(Qs)dvs  | 2  =  lim  |  /  h(Q\)dvs  |2  <  oo  a.s. 

Then  the  RHS  of  (5.46)  is  bounded  by 

—  lim inf  E  f  h(6l)2ds  <  ~T  ||  h  ||2  . 
cr  »-+ oo  Jo  a1 


I 


Let  W (x,  t)  be  Mortensen’s  value  function  for  the  observation  path  Z,  defined 
by  (5.29). 


Theorem  5.5  Under  the  above  assumptions,  for  each  closed  set  C  C  JRn  and 
each  open  set  G  C  IRn,  we  have: 


lim  sup  clog  or^(C) 

e-*0 

lim  inf  t  log  <rl(G) 


< 

> 


—  inf  W(x,t)  in  probability, 

x£C 

—  \niW(x,t)  in  probability. 

x€-G 


(5.47) 

(5.48) 


Proof:  Let  denote  the  cannonical  projection  0"  — »  IRn,  for  0  <  t  <  T.  Let 
A  £  B{IRn).  Then  from  (2.8) 


=  v(PrM) 

=  /  exp  P‘xm-  (5.49) 

jpt  A  L  e  J 

Recall  that  {P% }  obey  the  LDP  with  action  function  lx  defined  by  (2.38).  Now 


inf  [7x(9)  +  F(«)l  =  ini [inf  [!*(»)  +  F(9)] 
9&pUa  xeAe-.et=x 

=  inf  W(x,t). 

Then  apply  Theorem  5.4  to  (5.49)  for  the  sets  p p i^G. 


I 
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5.3  Large  Deviations 


We  have  seen  that  the  optimal  control  problem  associated  with  deterministic 
minimum  energy  estimation  plays  a  key  role  in  studying  the  asymptotics  of 
the  Zakai  equation.  In  this  section  we  shall  see  that  this  control  problem  is 
exactly  the  variational  problem  governing  a  large  deviation  principle  for  certain 
conditional  measures.  In  addition  we  study  the  weak  limit  of  the  conditional 
measures,  and  identify  the  limiting  filter. 

5.3.1  Hijab’s  Result 

We  present  Hijab’s  [26]  “pathwise”  LDP  for  certain  conditional  measures,  em¬ 
ploying  our  notation,  etc.  Hijab  proved  his  result  for  the  case  that  (2.2)  has 
fixed  initial  conditions  xe0  —  xq,  so  the  result  we  state  is  a  slight  extension  of  his; 
here  Xq  is  random  with  density  Pq{x)  defined  by  (2.1). 

Recall  the  measures  He,  IP  defined  by  (2.17),  (2.18)  respectively,  parame- 
terised  by  77  E  O0.  For  each  rj  E  Do,  the  measures  {P{n]}  obey  the  LDP  with 
action  function 

I{9)[rj]  =  inf  |s'o(x„(0))  +  7>fo  I  «(*)  I2  dt  :  xu  —  0  j  (5.50) 

where  here  xu  denotes  the  solution  of  (5.9)  with  initial  condition  s„(0)  =  0(0). 
Define 

F€{0,r,)  =  -riTh(6T)+  F  Ve{e„ri,)ds  (0  E  Dn,  p  £  fi0)  (5.51) 

Jo 

F(B,V)  =  ~r,Th{eT)+  [T  V{0s,Vs)ds  (5.52) 

Jo 

where  Vt  is  defined  by  (2.13)  and 

V(x,  y)  -  ^ h(x )2  +  yDh(x)f(x)  -  \y 2  |  Dh(x)  |2  (x  E  JRn,  y  E  JR).  (5.53) 
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Next  define 


J((%]  =  /(»)[!)] +  F(#,i))  (5.54) 

=  inf  j.So(z.<(0))  +  ji  i  |  u(t)  |!  +V(x,(t),Vl)dt  -  r,Th(x,(T))  :  x„  =  flj  ; 

compare  with  (5.10),  (5.11).  If  rj  €  this  expression  reduces  to 

inf  |s'o(a:u(0))  +  ^  \  u(t)  |2  +  ^h(xu(t))2  -  jth(xu(t))dt  :  xu  =  flj  ; 

where  xu  is  the  solution  of  (2.36).  Compare  with  (4.3),  (4.6),  (4.11).  Write 

J(0)[r]]  =  J(0)[r]}-  MJ(9')[t]}.  (5.55) 


Theorem  (Hijab  [26])  For  each  rj  £  D0,  {ne[?/]}  obey  the  LDP  with  action 
function  J(0)[rj]. 


Proof:  Apply  Varadhan’s  Theorem  to 


£(A)M  =  f 

J  A 


exp 


-F%O,rj)\P[v](d0)  (A£B(!f)). 


5.3.2  A  LDP  “in  Probability”  for  Conditional  Measures 

We  now  prove  large  deviation  results  for  the  conditional  measures  IIe,  defined 
by  (2.7),  (2.9)  respectively.  Recall  Definition  2.1.  Let  Fc,  F  be  given  by  (5.43), 
(5.44)  respectively  and  let  lx  be  the  action  function  for  {Px}  defined  by  (2.38). 
Define 

m  =  Ix(0)  +  F(6)  (5.56) 

=  inf  |  S"o(:eu(0))  +  ^  |  u(t)  I2  +^h(xu(t))2  -  zth(xu(t))dt  :  xu  = 
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where  xu  is  the  solution  of  (2.36).  Set 


m  =  (5.57) 

=  mw 

If  6  £  0"  is  absolutely  continuous,  then 

J(0)  =  So(0o)  +  ^  [T  |  0a  -  f(0t)  |2  +(zs  -  h(6s))2ds. 

Z  Jo 

Theorem  5.6  {IIe}  obey  the  LDPP  with  action  function  J(6)- 

Proof:  First  note  that  the  upper  and  lower  bounds  (iii),  (iv)  of  Definition  2.1 
follow  as  in  the  proof  of  Theorem  5.5  using  Theorem  5.4. 

By  definition,  J  >  0  and  is  lower  semi  continuous.  Choose  L  >  0  such  that 
F  >  -L.  Then 

{0  :  j(0)  <M}c{6:  I{0)  <  M  +  L] 

which  is  compact.  This  proves  (i)  and  (ii)  of  Definition  2.1.  I 

For  each  0  <  t  <  T,  define 

Jt(x)  —  W(x,t)—  mf^nW(x',t),  (5.58) 

=  inf  { Jt{0)  :  Qt  —  x), 

where  Jt(0)  is  defined  by  (5.57)  with  T  replaced  by  t. 

Corollary  5.2  {7r*}  obey  the  LDPP  with  action  function  Jt(x). 
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5.3.3  Limiting  Measure  and  Filter 

Recall  that  the  solution  X  =  {x(t)  :  0  <  t  <  T}  of  (2.35)  with  initial  condition  x0 
is  the  limit  in  probability  of  the  solution  of  (2.2),  and  that  Z  =  { z(t )  :  0  <  t  <  T] 
given  by  (5.1)  is  the  limit  in  probability  of  the  solution  of  (2.3).  W(x,t)  is  the 
corresponding  value  function  defined  by  (5.29),  and  Jt(x)  is  defined  by  (5.58). 


Lemma  5.7  For  each  0  <  t  <  T,  x{t )  is  the  unique  minimiser  ofW(-,t).  That 
is,  Jt(x(t))  =  0,  x(t)  =  x(t),  and  for  each  a  >  0 


inf  Jt(x)  >  0 

x:\x- a;(t)j>a 


(0  <  t  <  T ). 


Proof:  Clearly  Jt(x )  >  0  for  all  x  £  IRn.  Suppose  Jt(x*)  =  0  for  some  x*  £  IRn. 
Then  since  {0  :  0t  —  x*}  is  closed,  there  exists  0*  £  Q”  such  that  Jt(0*)  =  0  and 
6*  =  x*.  Then  6 J  =  xo  and 

o;  =  m),  ZS  =  h(0*s),  a.e.  0  <8<t. 

Now  X  £  fi"  is  the  unique  solution  of  (2.35),  hence  9 *  =  X.  Consequently, 
x*  —  x(t). 

Now  suppose  that  there  exists  a  >  0  such  that 


inf  Jt(x)  =  0. 


Let  {xjt}^L1  be  a  sequence  such  that 


lim  Jt(xk )  =  0. 

fc— o 

Since  Jt  is  an  action  function,  we  can  assume  Xk  — >  x*  for  some  x*  £  JRn  and 
Jt(x*)  =  0.  Then  |  x*  —  x(t)  |>  a  >  0  and  the  above  argument  shows  x*  =  x(t); 
a  contradiction.  This  completes  the  proof.  H 
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The  proof  of  the  following  theorem  uses  an  extension  of  results  in  Varadhan 

[55]. 

Theorem  5.7  Under  the  above  assumptions,  for  each  0  <  t  <  T ,  we  have: 

*t  =>  ^(t)  (5.59) 

as  e  —¥  0. 


Proof:  Let  0  <  t  <  T.  We  must  show  that  for  each  a  >  0,  (3  >  0  there  exists 
Co  >  0  such  that  0  <  e  <  eo  implies 

P  f>x(t))  >  a)  <  P-  (5.60) 

Define 

Ba  —  {a;  €  lRn  :|  x  —  x(t)  |>  a]  . 

By  hypothesis, 

inf  { Jt{x )}  =  7  >  0. 

Since  }  obey  the  LDPP,  there  exists  ex  >  0  such  that  0  <  e  <  ei  implies 

P(A)  >1-0 


for 

Ae  =  |elog7rt£(C)  <  -inf  +  ||  • 
Choose  0  <  eo  <  ei  such  that 

_  SL  . 

e  <  a 

provided  0  <  e  <  eo. 

Let  C  C  lRn  be  closed.  Then  on  Ae,  if  0  <  e  <  eo, 

<(Ba)  <  e-ft, 
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hence 


<{C)  <  6x(t)(Ca)  +  a, 

since:  if  C  C  Ba ,  then  6x(t)(Ca )  =  0  and  t^(C)  <  7^(67“)  <  a\  if  C  Ba,  then 
^t)(Ca)  =  1. 

Consequently,  (5.60)  holds.  | 

We  turn  now  to  the  pathwise  filters.  Recall  the  maps 

where  is  the  pathwise  filter  (2.20),  (2.21)  and  7t4  is  the  deterministic  estimator 
(5.13);  assumed  well  defined. 

Theorem  5.8  Under  the  above  assumptions ,  we  have 

KVi]  =>  *t[v\  (5-61) 

as  e  — >  0  uniformly  on  compact  subsets  of  Clo- 

Proof:  1.  From  the  results  of  earlier  sections,  we  see  that  for  each  p  G  D0, 

{^[77]}  obey  the  LDP  with  action  function 

Mx)lv\  =  W(M)M  -  W(x(t),t)[rj}.  (5.62) 

In  fact,  we  claim  the  following:  Let  K  C  Do  be  compact,  and  C  C  IRn  closed. 
For  all  fd  >  0,  there  exists  ex  >  0  such  that 

t\oga*(CM  <  -  inf  W(x,t)[V]  +  P  (5.63) 

for  all  0  <  e  <  ej,  and  all  ?]  £  K.  Here,  cr(e  is  the  unnormalised  measure  defined 

by  (2.19). 
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Suppose  that  (5.63)  is  false.  Then  there  exists  f3  >  0,  and  sequences  et-  — » 
0,  {r/i}  C  K  such  that 

e;  log  &¥ ( C)[r)i ]  >  -  inf  W ( x , t)[rn]  +  j3 

x£C 

for  all  i  —  1,2,...  Since  K  is  compact,  we  may  assume  that  rji  — >  fj  6  K  as 
i  — >  oo.  This  implies 


lim inf  e;  log  aj'(C)[rji]  >  -  inf  W(x,t )[?/]  +  /?. 

i— too  x&C 


(5.64) 


Now  employing  the  notation  used  in  the  proof  of  Proposition  2.1, 


VMC) 


< 

< 


El.fecexP  ()-G(V*‘,>>,)) 

Oft*”’*))  (^(X'Svi  -  •?)) 

(e1,., 6C exp  jj)))2  (iJexp  -  fj )) 


1 

2 


exp 


' M  ||  rji  —  rj 


1 

2 


The  last  inequality  uses  the  arguement  from  part  2  of  the  proof  of  Proposition 
2.1.  Hence 

limsup  log  <?l'{C)[rii]  <  -  inf  W(x,t)[fj]. 

i-HX>  X^C 

Combining  this  last  inequality  with  (5.64)  we  deduce  f3  <  0,  a  contradiction. 
This  proves  (5.63). 

2.  We  next  claim  that  for  all  a  >  0,  there  exists  7  >  0  such  that 


inf  Mx)[v]  >  7  >  0. 

ri£K,  x:\x- £(4)[r)]>a 


(5.65) 


Suppose  not.  Then  there  exists  a  >  0,  7 i  — »  0,  r/t-  — >  7/  €  K  and  { X{ }  C  IRn 

with 

|  x{  -  x(t)[j]i]  |  >  a 
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and 


0  <  Jt(xi)[r}i\  <  7f.  (5.66) 

From  Proposition  2.2,  it  follows  that  there  exists  C'  >  0  such  that 

W(x,t)[rj\  >  C'  |  x  |  —  C' 

for  all  (x,t)  e  IRn  x  [0 ,  T],  rj  €  K.  Now  r)i  ->  fj  implies  -»  x(t)[^],  so 

there  exists  M  >  0  such  that 


C'\xi\-C'  <  W(Xi,t)[Vi\  <  M, 


which  implies 


Therefore  we  can  assume  xt 


Xi  |  < 


M  +  C 
C' 


xe  JR"  with 


|  x  —  x(t)[rj]  |  >  a. 

Sending  i  — >  oo  in  (5.66)  we  deduce 

W(x,t)[r}\  =  W(x(t),t)[Tj\. 

Since  the  deterministic  estimator  is  well  defined,  this  equality  forces  x  =  x(t)[fj], 
a  contradiction.  This  proves  (5.65). 

3.  To  prove  (5.61),  we  must  show  that  for  all  6  >  0  there  exists  eo  >  0  such  that 

(5-67) 

for  all  0  <  e  <  e0  and  all  ?)£/(;  where  ^  is  the  Prohorov  metric  on  V(lRn). 

Let  6  >  0  and  define 


Bs  =  {x  :|  x  -  x(t)[r)]  |>  5}  . 
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From  (5.63),  (5.65),  there  exists  ei  >  0,7  >  0  such  that 

dog ^t(Bs)[v]  <  -  inf  Jt(x)[v\  +  7:, 

xe&s  z 

and  hence 

for  all  0  <  e  <  Ci  and  all  rj  £  K.  Choose  eo  >  0  such  that 

e~£  <  8. 


Then  we  deduce 

*t(c)[v]  <  Kt(Cs)[T]}  +  8 

for  all  0  <  e  <  e0,  V  £  K.  Hence  (5.67)  follows. 


I 
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Chapter  6 
Observer  Designs 

In  this  chapter  we  present  observer  designs  for  the  nonlinear  control  system 
x  =  f(x,u),  a;(0)  =  s0,  (6.1) 

y  =  Kx ) 

where  x  E  IRn ,  u  E  lRm ,  |tqj  <1  i  =  1, . . .  ,m  and  y  E  1RP .  The  initial 
condition  xq  is  unknown. 

We  first  consider  a  finite  dimensional  observer,  and  prove  the  following  re¬ 
sult  for  this  design:  provided  that  we  have  some  knowledge  of  x0  (in  the  form 
|x0  —  mo|  <  p,  where  m0  is  the  initial  condition  of  the  observer)  and  assuming 
that  (6.1)  satisfies  a  detectability  condition,  then  the  observer  trajectory  m(t) 
converges  exponentially  to  the  system  trajectory  x(t)  as  t  — >  oo.  The  radius 
of  convergence  p  depends  on  the  nonlinearities  in  the  dynamics  and  observa¬ 
tions  as  well  as  on  certain  design  parameters.  For  a  certain  class  of  systems,  no 
knowledge  of  x0  is  required. 

This  design  is  a  development  of  the  design  given  in  Baras,  Bensoussan  and 
James  [2],  a  paper  which  treats  systems  with  uncontrolled  nonlinear  dynamics 
and  linear  observations.  The  main  contributions  here  are  the  results  for  nonlinear 
observations  an d  controlled  dynamics.  We  remark  that  these  designs  do  not 
involve  coordinate  transformations,  canonical  forms,  local  linearization,  etc.,  and 
seem  robust  when  compared  with  other  designs.  However,  the  designs  do  involve 
solving  Riccati  equations  and  computing  certain  matrices  and  constants. 

We  also  give  an  infinite  dimensional  observer  design  based  on  the  deter¬ 
ministic  estimator.  Assuming  that  (6.1)  satisfies  an  observability  condition,  the 
observer  computes  exactly  the  unknown  state  after  a  finite  time  has  elapsed. 
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6.1  An  Observer  Design 


We  assume  that  /,  h  are  smooth  with  bounded  derivatives  of  orders  1  and  2. 
Assume  t  u(t)  is  continuous.  Let  N  £  L(]Rn ,  Mn),  R  £  L(1RP ,  IRP)  and 
assume  rank  N  =  n  and  R  >  0. 

Write  A(x,u)  =  Df(x,u),  H(x )  =  R~^Dh{ a;),  where  D  denotes  gradient  in 
the  x  variable.  Set 

||A||  =  sup{||A(z,u)||  :  x  £  Mn,  \ui\  <  1} 

and  similarly  define  \\H\\,  and  so  on. 

Consider  the  coupled  system 

rh(t)  -  /(m(t),u(t))  +  g(t)/f(m(t))'JR-1(y(t)-h(m(t)))  (6.2) 

m(0)  =  mo 

Q(t)  =  A(m(t),u(t))Q(t)  +  Q(t)A(m(t),u(t)Y 

-Q{t)H(m{t))'H(m(t))Q(t)  +  NN'  (6.3) 

Q(0)  =  g0>o. 

This  is  our  finite  dimensional  observer  for  (6.1).  It  is  essentially  a  modifica¬ 
tion  of  the  deterministic  or  minimum  energy  estimator,  as  discussed  in  Baras, 
Bensoussan  and  James  [2],  In  Section  4.4,  this  system  was  interpreted  as  an  ap¬ 
proximate  deterministic  minimum  energy  estimator.  Note  in  particular  that  the 
Riccati  differential  equation  (6.3)  depends  on  the  control.  This  is  not  necessary 
when  f(x,u )  =  f{x)  +  Bu  :  set  A{x)  =  Df(x). 

We  will  assume  that  the  system  (6.1)  is  uniformly  detectable,  or  uniformly  of 
full  rank ;  see  Section  3.2.  Since  N  has  rank  n  and  ||  A||  <  oo,  the  pair  (A(x,  u ),  N ) 
is  uniformly  stabilisable  (refer  to  Section  6.2.1),  and  NN'  >  r0I  for  some  ?’o  >  0. 
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Let  PQ  —  Qq1,  P(t )  =  <5(i)-1,  and  P,Q  be  the  bounds  for  ||-P(t)||,  ||<5(£)|| 
(given  in  Section  6.2.2). 

Regard  Ao,N,R  as  design  parameters.  Define  p  =  p(Q0,  N,  R)  by 

P  =  ~~PW7.  (v^lPVll  +  vAII*-1 1|2  ||M||  ||£i2/.||) "  (6.4) 

Q' Iko  I! 

Our  main  theorem  is  the  following  convergence  result,  similar  to  Theorem  8  in 

[2]- 

Theorem  6.1  Assume  there  exist  Qo,N,R  such  that 

|z0  -  rao|  <  p{Qo,  N,  R).  (6.5) 

Then  the  system  (6.2),  (6.3)  is  an  observer  for  the  nonlinear  control  system  (6.1) 
provided  that  (6.1)  is  uniformly  detectable  or  uniformly  of  full  rank ,  and  the 
above  assumptions  hold.  That  is,  there  exist  contants  K  >  0, 7  >  0  such  that 

\x(t)  -  m(i)|  <  K\xo  —  m0|e_7t  (6.6) 

for  all  t  >  0. 

Remark  There  is  a  trade-off  between  the  decay  rate  7  =  7(^0,  N,  R)  and  the 
radius  of  convergence  p.  The  designer  can  optimize  his  choice  of  7,  p  by  varying 
the  design  parameters.  /// 

By  using  different  estimates  for  the  nonlinearities,  we  obtain  an  observer  for 
(6.1)  without  any  contraints  on  the  initial  conditions  Xo,  mo  for  a  class  of  systems. 
Included  in  this  class  are  systems  for  which  A(x,  u)  is  uniformly  negative  definite 
(see  the  example  in  Section  6.4.2). 

Define  8  —  8(Q0,N,R)  by 

6  =  ^  -4pII-d/II  -4|i.fr'||2||D/!||2.  (6.7) 
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If  D2  f  or  D2h  is  zero,  we  omit  the  corresponding  term  from  (6.7). 

Corollary  6.1  Assume  there  exist  Qo,N,R  such  that 

0<6(QOyN,R).  (6.8) 

Then  the  system  (6.2),  (6.3)  is  an  observer  for  the  control  system  (6.1)  pro¬ 
vided  that  (6.1)  is  uniformly  detectable  or  uniformly  of  full  rank,  and  the  above 
assumptions  hold.  That  is,  there  exist  contants  K  >  0, 7  >  0  such  that 

|x(i)  —  m(t) |  <  I(\xo  —  m0 |e_7i  (6-9) 

for  all  t  >  0  and  all  x0,  m0  6  lRn. 

Remark  Our  design  can  readily  be  extended  to  time  varying  systems.  One  has 
to  modify  appropriately  the  detectability  conditions.  /// 


6.2  Riccati  Equations 

Write  X  =  lRn  x  [—1,  l]m  and  £  =  (x,  u)  €  X.  If  t  1— *  =  (xt,  ut )  is  a  continuous 
curve,  we  write  At  —  A(ft)  =  A(xt,ut),  etc. 

Consider  the  Riccati  differential  equations 

Qt  =  AtQt  +  QtA't  -  QtH[HtQt  +  NN',  (6.10) 

Pt  =  —PtAt  —  A'tPt  —  PtNN'Pt  +  H'tHt,  (6.11) 

Qo  =  Po1  >  0. 

Existence  and  uniqueness  for  (6.10),  (6.11)  are  standard,  and  note  that  Pt  —  QT1- 
In  this  section,  we  obtain  uniform  bounds  for  the  solutions  of  these  Riccati 
equations. 
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6.2.1  Uniform  Detectability  and  Stabilisability 

The  bound  for  ||  Qt  ||  requires  a  detectability  condition,  such  as  uniform  de¬ 
tectability  or  uniform  full  rank. 

To  obtain  a  uniform  bound  for  ||  Pt  ||,  we  assume  that  rank#  =  n  and  use  the 
following  uniform  stabilisation  result,  based  on  Kalman  [35].  Let  $f(Mo)  denote 
the  fundamental  transition  matrix  corresponding  to  a  time  varying  matrix  Ft. 
Recall  NN'  >  r0I. 

Lemma  6.1  Assume  rankN  —  n.  Consider  the  control  system 

zt  =  -Atzt  -  Nut,  ^(O)  =  z,  (6.12) 

where  At  —  A(£t)  for  some  curve  t  1— >  Then  there  exists  a  feedback  control 

u°  =  Ttzt  such  that 

II  0)  ||<  \Jjoex P  (“2^^  “  to^j  ’  (6-13) 

for  t  >  to  >  0,  where  At  =  —  At  —  NTt  and  for  any  a  >  0, 

f30(a)  =  <7e-2(dWI  (l  +  ||  N  ||2)_1  , 

&(<r)  =  cr e4ordj4ll  ^  , 

II  rt  ||<||#||  fli(<r)  =||  r  || . 

Proof:  Consider  the  optimal  control  problem  of  minimising  the  cost 

/*oo 

Ju(to,z)=  I  II  Zt  ||2  +  II  Ut  ||2  dt 

Jto 

over  controls  u  for  the  system  (6.12).  Define  the  controllability  gra.mmian 
C(to,to  +  O’)  —  /  t)NN  t)dt. 

''to 
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Note  that  ||  ||2<  e2ilAH,  and 

r0ae~2^I  <  C(t0,t0  +  cr)  <||  N  ||2  ae2^I 

so  that  the  system  (6.12)  is  uniformly  completely  controllable  [35]. 
The  value  function  is  given  by 

V(t0,z)  =  ~z'Ztoz, 

where  Zt  solves  the  Riccati  equation 

Z>t  —  AtZt  +  ZtA't  +  ZtNN  Zt  —  I . 

The  optimal  control  is 

u°t  =  -N'Ztzt. 

Now 

rto+o- 

V(to,z)  >  /  \\zt\\2dt 

Jto 

>  p0(a)  |  z  |2  . 

Also,  using  the  control 

u\  =  N'Q-A{to-> to  +  cr)  lz, 


we  obtain 

V(t0,z)<f31(a)\z\2  . 

Finally,  note  that  V(t,z)  is  a  Lyapunov  function  for 

z°  =  —  {At  +  NTt)  z°] 


whence  (6.13). 


I 
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6.2.2  Bounds 


Theorem  6.2  Assume  that  £  i— >  A(f),H(f)  are  continuous  and  bounded,  (6.1) 
is  uniformly  detectable,  and  rankN  =  n.  Then  we  have 

|2  +  ||A||n 


||Qx||  <  [\\Qo\\  + 
IXtII  <  (|l|ft||  + 


2a0 

m2+ 


2/30 


q  <  oo, 

=  p  <  oo. 


(6.14) 

(6.15) 


These  bounds  are  independent  of  T  >  0. 


Note  The  bound  q  depends  on  the  choice  of  A,  while  p  depends  on  a.  To 
obtain  the  best  bound,  one  can  optimise  over  these  parameters.  For  linear  time- 
invariant  detectable  systems,  one  can  also  obtain  a  bound  for  ||  Qt  ||.  Ill 


Proof:  We  modify  an  argument  due  to  A.  Bensoussan  in  [2].  To  prove  (6.14), 
consider  the  following  linear  optimal  control  problem  with  time-varying  coeffi¬ 
cients: 

-i it  =  A!tr}t  +  H[vti  Vt  =  h,  (6.16) 

where  h  G  IRn  is  given  and  v  is  the  control.  The  cost  functional  is 

Ji(v,T)  =  PoQoVo  +  J\v[vt  +  rj'tNN'r,t)dt.  (6.17) 

Define  a  value  function 

V1(h,T)  =  mf{J1(v,T):riT  =  h} 

The  Hamilton- Jacobi-Bellman  (HJB)  equation  is 

+  max  [A,  1A  (-A<  -  H'tv)  -  v2  -  q'NN'}  =  0 
oT  v 

Let  Qt  be  the  solution  of  (6.10).  Then 

V(r),t )  =  q'QtV 
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is  the  unique  (viscosity)  solution  of  (6.11)  with  V(r],  0)  =  rj'Q or). 
Consider  the  (suboptimal)  feedback  control  law 

v(t)  =  A  'tt]t. 


Then  by  (6.16), 


-r)t  =  +  H[ A't)rjt,  t]t  =  h. 


Then  we  have 

V(r],T )  =  h'Qxh  <  rjoQorio  +  f  r)'t(NN>  +  AtA't)r)tdt 

Jo 

Now  using  (6.18), 

M2-2  [TV't(A,t  +  H'tA't)rhdt=\h\2 

Jo 

Hence  using  uniform  detectability  (3.7),  |r?0|2  <  \h\2  and 

fT  \h\2 

l  l’"1  d*  ~  2ao 

Combining  this  with  (6.19)  we  obtain 

h'Qrh  <  h'  (||Q„||  +  l|JV|1^ol|A'-)  * 

which  proves  (6.14). 

Similarly,  to  prove  (6.15),  consider  the  optimal  control  problem 

At  =  At  \t  T  Nvt,  Xt  —  h 


(6.18) 


(6.19) 


(6.20) 


with  cost 

J2(v,  T)  =  \'0P0X+  I  (v't  vt  +  \'t  H[  Ht  A t)dt. 

J  0 

Define  the  value  function 

V2(h,T)  =  inf{J2(u,T)  :  \T  =  h}. 
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Then  the  HJB  equation 


O 

—V2  +  ma x[Dx  V2(At A  +  Nv )  -  v2  -  \'  Il[  Ht  A]  =  0, 

with  V2(A,  0)  =  A'  P0  A  and  solution  V^A,  t)  =  A'  Pt  A,  where  Pt  is  the  solution  of 
(6.11).  By  Lemma  6.1,  set 

v(t)  =  r(  A t, 


then  by  (6.20) 


Ai  =  [At  +  iVTf);  A  T  =  h. 


(6.21) 


Thus 


h'PTh  <  A'  P0  Ao  +  f  a;  (T't  T<  +  H't  Ht) At  dt 
Jo 


Now 


At  -  $i(-t,-P)/t, 


and  hence  |  A0  |2<  ^  |  /i  |2  and 


This  together  with  (6.22)  yields  (6.15). 


(6.22) 


I 


Corollary  6.2  Assume  that  £  (— >  A(£),  H(£)  are  bounded  and  continuous,  and 
that  (6.1)  is  uniformly  of  full  rank.  Then 

II  Qt  ll<(-  II  Qo  II+I|JV||2  +  A  l|2)  =  g  <  co,  (6.23) 

\«i  2a0  ) 

for  all  T  >  0,  where  for  any  r  >  0, 


ao{r)  =  Te"2r (l  +  r2e2T|14 »  ||  N  ||2 

II  N  ||2 


-l 


ai(r)  =  Te4T”AH  1  + 


r0r 
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Proof:  Consider  the  control  system 


zt  —  A'tzt  +  H'tut,  z(0)  =  z,  (6.24) 

where  At  =  A(&)  for  some  curve  t  >— >  £*.  Define  the  grammian 

fto+r 

O(to,to  A- t)  =  /  $Ai(to,t)H'tHi$'Ai(to,t)dt. 

Jto 

Now  ||  ||2<  e2lly4H,  and 

SqTE~  2mI  <  O(t0,to  +  r)  <||  H  ||2  re2||yt|l7 

so  that  the  system  (6.24)  is  uniformly  completely  controllable  [35].  Now  proceed 
as  in  the  proof  of  Lemma  6.1.  | 


6.3  Asymptotic  Convergence 

Using  the  bounds  (6.14),  (6.15)  we  prove  Theorem  6.1,  and  Corollary  6.1.  The 
proofs  are  modifications  of  a  proof  due  to  A.  Bensoussan  in  [2]. 

Proof  of  Theorem  6.1  The  error  e(t)  —  x(t )  —  m(t)  satisfies 

e(t)  =  f(z(t),u(t))  -  f(m(t),u(t))  -  Q(t)H(mt)'RT1(y{t)  -  h(m(t))) 

=  [ A(m(t),u(t ))  -  Q(t)H(m(t))'  H(m(t))]e(t) 

+  [f(x(t),  u(t))  -  u(t ))  -  Df(m(t),  u(i))e(<)] 

—  Q(t)7J(m(t)),/?_1[/i(x(t))  —  h(m(t ))  —  Dh(m(t))e(t)] 


Therefore  using  the  Riccati  equation  (6.11)  for  P(t), 


P(t )  e(t)  =  -e(<)'  P(t)NN'P(t)e{t)  -  e(t)' H (rn(t))' H (m(t))e(t) 

+2 e(t)'P(t)  [  [  rD2  f(m(t)  -f  rse{t),u(t))e{t)2drds 
Jo  Jo 

—<2e(t)'H{m{i))'R~1  I  [  rD2h(m(t)  +  rse(t))e(t)2drds 
Jo  Jo 

||P(t)»e(t)P  <  e(!)'(-ro/«2+|i>(0M*)l[V?ll^/ll  (6-25) 

+V?[|i?-1||2||fl/.||  |[Z>2/>||])  e(t) 

Let  C  =  {y/p\\D2 f\\  +  •\/q((^_1([2  \\Dh\\  \\D2h\\).  %  hypothesis  (6.5)  we  have 

—  ~y  +  \Po  eo\C  <  0. 

r 

Since  P(t)?e(t)  is  continuous,  there  is  an  interval  [0,  t0)  such  that 

-p  +  \P(t)h(t)\C  <  0  on  [0,<o)- 
But  then  (6.25)  implies 

~\P(t)h(t)\2  <  0  on  [0,fo), 

and  thus 

\P(t)ie(t)\  <  |fT„| 

for  t  £  [0,to)-  By  continuity  this  inequality  holds  for  t  6  [0,fo]-  Hence  we  can 
proceed  from  t0  on.  Thus  there  exists  6  >  0  such  that 

ip«k*)i  <  \  (jjl  -  *) 

for  all  t>  0.  So  (6.25)  implies 

jt\P{t)h(t)\*  < -S\e(t)\\ 
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But  from  (6.15) 


e(t)'P(t)e(t)<\\P(t)\\  \e(t)\2  <  p\et\\ 

so  that 

Jt e(t)'P(t)e(t )  <  -8-e(t)'P(t)e{t), 

which  implies 

e(t)'P{t)e(t)  <  e(0)'Poe(0)e-pJ,  t  >  0. 

Therefore,  using  (6.14),  we  have 

|e(*)|2  <  qe(t)'P(t)e(t) 

<  qe(0yPoe(0)e-p\  t  >  0, 

which  implies  (6.6).  I 

Proof  of  Corollary  6.1  We  have 

je(t)'P{t)e(t)  =  - e(tyP(t)NN'P{t)e(t )  -  e(t)'. II(m{t))' H{m(t))e(t) 

+2 e(tyP(t)(f(x(t),u(t))  -  u{t ))  -  Df(m(t),u(t)))e(t)) 

- 2e(ty(R-1Dh(m(t)),R-1(h{x(t ))  -  -  Dh(m(t)e(t )) 

<  (-^  +  4p||D/||  +  4||iT1||2  ||.D>*||2)  |e(i)|2 

By  assumption  (6.8)  there  is  a  6  >  0  such  that 

~  +  4p||J3/||  +  4UJT1 1|2  \\Dhf  =  -S<  0. 
r 

Therefore 

4|F(()5e(()|2  <  -<W<)|2 

for  all  t  >  0  and  all  e0  €  Mn.  This  implies  (6.9).  I 
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6.4  Examples 


We  now  give  some  simple  examples  to  illustrate  the  applicability  of  our  design. 

6.4.1  Bilinear  Dynamics,  Linear  Observation 

Consider  the  general  bilinear  system 

m  \ 

A  +  ^2  uiBi )  xi  x(°)  =  xo,  (6.26) 

i= 1  / 

y  =  Cx. 

We  assume  |«t-|  <  1,  p  =  1,  and  here  (  =  «£  [—1,  l]m  =  X.  Write 

m 

A(u)  =  A  +  Y^UiBi. 
i= 1 

Define,  for  r  >  0,  the  observability  grammian 

fto+T 

O(to,to  +  T)=  /  t)dt, 

J  to 

where  At  =  A(u(t)).  Assume  that  (6.26)  is  uniformly  observable  in  the  sense 
that  there  exists  r  >  0  such  that  for  all  to  >  0 

7o(T )1  <  O(t0,  t0  +  r)<  7i(r)/ 

for  constants  7o(r)>  7i(T)  >  0;  independent  of  the  control.  The  we  can  bound 
||  Qt  ||  as  in  Corollary  6.2. 

Then  the  following  system  is  an  observer  for  (6.26),  with  no  contraints  on 
the  initial  conditions. 

m(t )  =  A{u{t))m{t)  +  Q(t)Cf(y(t)  -  Cm(t)),  m(0)  =  m0,  (6.27) 

Q(t)  =  A{u{t))Q{t)  +  Q{t)A{m{t))'-Q{t)C'CQ{t)  +  I ,  Q0  =  I. 
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For  simplicity  we  have  taken  Q0,N,R  to  be  identity  matrices.  To  improve  the 
decay  rate  7,  one  could  try  other  values  for  Q0,  N,  R. 

Compare  this  design  with  the  design  for  linear  time-varying  systems  in 
Willems  and  Mitter  [57],  and  0’  Reilly  [49]. 

Now  in  JR?  consider 


A(u)  = 

c  = 


an(u) 

v  a2i(u) 

(1,0), 


a12(u) 

a22(u)  y 


and  assume  that 


«22(^)  <  — Q!o,  ao  >  0, 


for  all  |  u  |<  1.  Then  setting 

A(«)  = 

gives 


(  (  ,  \ 

-cto  -  an(u) 

^  ~au(u)  ~  «2i(w)  j 


T]' (A(u)  +  A(u)C)rj  =  -a0  |  r]  |2  - 

Then  (C,  A(u))  is  uniformly  detectable  and  (6.27)  is  an  observer  for  (6.26). 


6.4.2  Stable  Linear  Dynamics,  Nonlinear  Observation 


Consider  the  system 


/  ■  \ 

X\ 

I*2 

y  = 


l 


\ 


1  72 
72  -4 


/  \ 
27 

+ 

re 

l*2  J 

re 

sin  07. 


(6.28) 


This  system  is  controllable  and  observable.  However,  the  pair  of  matrices 
( Dh(x ),  A)  is  not  observable  for  07  =  where  k  is  an  odd  integer.  The  system 
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has  eigenvalues  -1,-2  and  A  is  symmetric,  hence  (Dh(x),A)  is  automatically 
uniformly  detectable,  with  a0  =  1,  A(s)  =  0.  Let  R  —  rl,  N  =  y/r^I,  Q0  =  7 2I. 
Here,  H(x)  =  ^(cos £1, 0).  Now 

8  —  r0( 72  +  r0/2)-2  -  4r-2. 

Set  r  =  3,  r0  =  0.2,  7  =  0.1.  Then  8  =  7.82. 

The  observer  for  (6.28)  is 

m(t)  =  Am{t)  +  Bu{t)  +  ~Q(t)H(m(t))'  (y(t)  —  sinrai(t)),  (6.29) 

O 

Q(i)  =  Ag(i)  +  g(0A,-g(t)L/(m(t)),77(m(t))g(i)  +  0.27. 

By  Corollary  6.1,  m(t)  converges  exponentially  to  x(t)  for  all  x0,mo  G  JR” . 


6.4.3  A  Special  Case 

We  now  give  an  example  of  a  class  of  systems  for  which  observer  design  is  easy. 
Consider 


x(t)  =  f(x(t),u(t)),  x(0)  =  x0,  (6.30) 

y(t)  =  Cx(t), 

assuming  that  C  is  invertable: 

C'C  >  s0I  >  0. 


Then  the  system 

m(t)  =  — C'  {y(t)  —  Cm(t)) ,  m(0)  =  m0  (6.31) 

is  an  observer  for  the  control  system  (6.30)  for  all  initial  values  provided  that 
given  7  >  0,  a  is  chosen  so  that 

"-7+  ||  All- 
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To  see  this,  we  have 

f  I  e(<)  !2  =  2e(f)  (f(x(t),  u(())  -  f(m(t),u(t))  -  i-C'Cef*)) 

<  2  (11  a  H  ~)  1  eW  I2 

<  -2-r  I  =(0  I2  - 

Thus 

|  x(t)  —  m{t)  |  <  |  a?o  —  m0  |  e_7< 

for  all  Xo,m0  £  Mn. 

Of  course  one  can  compute  x(t)  directly  by  inverting  C.  The  above  dynamical 
procedure  seems  more  robust.  In  fact,  one  could  use  it  to  invert  matrices! 

The  rate  of  convergence  can  be  made  arbitrarily  fast  by  adjusting  7  >  0. 

6.5  An  Exact  Observer 

In  this  section  we  shift  our  perspective  somewhat  and  obtain  an  infinite  di¬ 
mensional  observer  for  uncontrolled  systems.  The  observer  recovers  exactly  the 
unknown  state  under  a  natural  observability  condition. 

Consider  the  dynamical  system 

x  =  f(x),  x(0)  =  x0 ,  (6.32) 

V  =  Hx), 

and  assume  /  £  C\ (IRn ,  Mn) ,  h  £  Cb(lRn,lRp)-  Let  x(t)  =  7(^*0  denote  the 
solution  to  (6.32). 

Definition  6.1  The  system  (6.32)  is  observable  in  finite  time  if  there  exists 
0  <  T  <  00  such  that  for  any  Xq  Xq  there  is  a  t  £  [0,  T]  such  that 

MtWso)  ^  M7(%o)- 
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Compare  with  the  definition  of  observability  given  in  Chapter  3,  and  also  the 
definition  for  time-varying  linear  systems.  Our  design  also  applies  to  controlled 
systems  provided  that  Definition  6.1  holds  for  each  control  function. 

We  modify  the  deterministic  estimator  (4.7)  by  setting  S0  =  0,  and  obtain 
the  following  system: 

§-tm(x,t)  +  f(x)  ■  Dm(x,t)  +  ^  |  Dm(x,t)  |2  -^h(x)2  +  y(t)h(x)  =  0, 

m(x,  0)  =  0,  (6.33) 

*(0  =  m(x,t). 

This  system  is  of  the  form  (3.2)  with  state  space  M  =  C(IRn,lR ),  and  is  con¬ 
trolled  by  the  observation  path  y(-)  of  (6.32). 

Theorem  6.3  If  the  system  (6.32)  is  observable  in  finite  time ,  then 

x(t )  =  x{t ) 

for  all  t  >  T  and  all  x0  £  lRn .  Therefore  (6.33)  is  an  observer  for  (6.32). 

Proof:  We  have 

m(x,t)  =  hrf  |  ^  |  Os  -  f{0s)  |2  +  |  ys  -  h{9s )  |2  -  |  ys  |2  ds  :  0t  =  x | . 

Then,  for  t  >  T,  m(-,t)  has  a  unique  minimum  x(t)  —  x(t)  if  and  only  if  (6.32) 
is  observable  on  [0,  T],  Note  that  (3.3)  is  satisfied  a  fortiori.  | 

Remark  This  type  of  observer  can  be  defined  for  systems  evolving  on  manifolds 
with  manifold-valued  observations.  This  and  other  developments  will  be  treated 
elsewhere.  /// 
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Chapter  7 

Miscellaneous  Topics 

We  collect  in  this  final  chapter  some  miscellaneous  comments  and  results. 
A  large  deviation  principle  for  the  distributions  of  the  conditional  measures  is 
presented  in  Section  7.1.  Thus  there  is  a  second  level  of  large  deviation  behaviour 
associated  with  our  asymptotic  filtering  problem.  The  Benes  class  of  systems,  for 
which  finite  dimensional  filters  exist,  is  treated  in  Section  7.2.  Then  in  Section 
7.3  we  obtain  the  different  limits  that  result  when  the  signal  to  noise  ratio  is 
altered.  Finally  in  Section  7.4  we  make  some  concluding  remarks. 

7.1  Distributions  of  Conditional  Measures 

Let  be  the  conditional  probability  measures  defined  by  (2.9),  (2.10),  viewed 
as  random  elements  of  T(IRn)  (2.11).  The  distribution  of  is  a  probability 
measure  P*t  on  V{JRn)  defined  by 

PiXA)  =  p  (K  z  A)  (XeS(V))).  (7.1) 

In  this  section  we  establish  a  LDP  for  {P£t}- 
Recall  the  maps 

7 r|,  7ft  :  %  -*•  T(IRn), 

where  % is  the  pathwise  filter  (2.20),  (2.21)  and  7rt  is  the  deterministic  estima¬ 
tor  (5.13);  assumed  well  defined.  Recall  that  {Py}  obey  the  LDP  with  action 
function  Iy  defined  by  (2.40).  For  v  £  V{lRn ),  define 

LM  =  inf  {lr(v)  '■  V  €  7f7V)}  .  (7.2) 
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Theorem  7.1  Under  the  above  assumptions,  {P^t}  obey  the  LDP  with  action 
function  IVt(v). 


Proof:  Noting  that 


7Tt 


P£ 

rY 


and  since  71^,  7 r<  are  continuous  and  limf_o  k\  =  7 r<  exists  uniformly  on  compact 
subsets  of  fio,  we  can  apply  the  contraction  principle  to  deduce  the  assertion.  | 


Notice  that 


inf  jV(i(i),  ()[>;]  +  jjf  rilds  :tl€S 1"' 


(7.3) 


7.2  The  Benes  Class 

In  [3],  Benes  introduced  a  class  of  systems,  with  nonlinear  drift  and  linear  obser¬ 
vations,  for  which  the  conditional  density  can  be  explicitly  computed  by  a  finite 
set  of  ordinary  and  stochastic  differential  equations.  That  is,  the  stochastic  filter 
is  finite  dimensional.  Hijab  [25]  has  studied  this  class,  as  well  as  the  analogue 
for  deterministic  estimation.  In  this  section  we  apply  the  asymptotic  filtering 
method  to  the  Benes  class  of  systems.  Under  certain  (restrictive)  conditions,  an 
observer  results  in  the  limit. 

We  are  interested  in  systems  of  the  form 

x  =  /(x),  x(0)  =  so,  (7.4) 

i  =  Cx,  z(0)  =  0, 

where  x(t)  e  JRn  and  z(t)  €  IBP.  Consider  the  filtering  problem 

dxe(t)  =  f(xc(t))dt  +  y/eNdw(t),  xe(0)  =  xc0,  (7.5) 
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dxf(i)  =  Cxt(t)dt  +  y/eRdv(t ),  ye(0)  =  0, 


where  NN'  >  r0I,  R  >  0. 

The  Benes  class  of  drifts  /  £  C°°(lRn,lRn)  are  those  satisfying: 

(i)  for  some  F  £  Cca(IRn,IR), 

f(x)  =  —NN'  DF(x)'\ 

(ii)  for  all  e  >  0, 

f(x)'(NN')~1f(x)  +  ediv/(a;)  —  x'^x  +  2'Ex  +  2c/T, 

where  ££  — ►  £,  S£  — >  S,  (p£  — >  as  e  — >  0,  and  ££,£  are 

symmetric. 

Two  additional  detectability  conditions  are  required: 

(iii)  D2F(x)  >  0; 

(iv)  for  all  e  >  0, 

C'{RR!)-^C  +  E£  >  a0I,  ao>0. 

The  initial  condition  x0  is  unknown,  so  we  let  xc0  be  random  with  density 
Po(x)  =  Ceexp(-\S0(x)),  where 

50(aj)  =  ^{x  -  no )'Po(x  -  Ho)  +  F(x), 

for  some  Ho  €  Mn,P0  >  0.  As  before,  P0,N,R  are  design  parameters,  although 
N  is  restricted  according  to  (i)  and  (ii)  above. 

Let  qe(x,t)  denote  the  unnormalised  conditional  density  solving  the  Zakai 
equation.  We  use  the  exponential  representation: 

(7-6) 
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Using  Ito’s  rule,  We(x,t)  solves 


dWt  -  ~tr(NN'D2We)dt  +  i  |  DWeN  |2  dt  +  DWe  ■  f(x)dt 

Li  Lj 

—  —  |  R~lCx  |2  dt  +  x'C' (RR')~X dy^^^t)  —  ed\vf(x)dt  =  0,  (7-7) 

Li 

Wc(x,  0)  =  S0(x)  -  e\ogCc. 

We  now  find  an  explicit  solution  for  qe(x,t)  via  (7.6),  (7.7)  using  a  gauge  trans¬ 
formation. 

Define 

Ue(x,t)  =  We(x,t)~  F(x). 

Then  we  have 

dUc  -  ^tv(NN'D2Ue)dt  +  \  |  DUeN  |2  dt  -  3exdt  +  x'C\RR!)-xdy\t) 

Lj  Li 

-\x\C\RR!)-xC  +  Xe)xdt-^dt  =  0, 

Lj 

uc(x,  0)  =  ^(®  -  fio)'Po{x  -  Ho)  -  elogCe. 

If  we  look  for  solutions  of  the  form 

Ue(x,t)  =  ^(x-pt(t)),Pc(t)(x-nc(t))  +  il>e(t), 

we  obtain,  after  some  manipulation, 


dyc(t)  = 

-P'{t)-1(Zt  +  2cne(t))dt 

(7.8) 

PPt{t)-xC\RR!)-1{dy\t)  -  C^(t)dt), 

P£(t)  = 

—Pe(t)NN,Pe(t)  +  C\RR!)~lC  +  Se, 

(7.9) 

II 

V 

+  \doPodo  -  clog  Ct 

(7.10) 

+  f  ( |tr(JVJV'P‘(s))  -  ^,’(s)'P‘(s)NN'P‘{s)„‘(s)  +  v‘)  da, 
with  initial  conditions 

^(0)  =PL  o,  PE(0)  =  Po.  (7.11) 
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Then  one  can  prove: 


Under  assumptions  (i)-(iv),  the  unnormalised  conditional  density  qc(x,t )  has  the 
explicit  solution  (7.6),  where 

Wc(x,t)  =  i(ar-  nc(t))'Pc(t)(x-  p€(t))  +  +  F(x), 

and  Pe(t),  pe(t),  ^(t)  are  the  solutions  of  (7.8)-(7.11). 

A  filter  for  the  maximum  likelihood  estimate 

mV)  =  W‘(x,t) 

can  be  obtained  by  differentiating  the  equality:  DWc(me(t),  t)  =  0  (c.f.  Chapter 
4).  In  general  me(f)  does  not  equal  the  conditional  mean  xe(t). 

We  now  turn  to  the  limiting  filter:  the  deterministic  estimator  for  the  obser¬ 
vation  record  Z  =  { z(t )  :  0  <  t  <  T}.  Here,  Z  is  the  limit  in  probability  of  Ye, 
given  by  (7.4),  and  is  viewed  as  an  observation  record  of  the  auxilliary  system 

*  =  /(*)  +  Nu,  5(0)  =  x0,  (7.12) 

V  =  Cx  +  Rv,  ?/(0)  =  0, 

with  cost 

Jt(x0,u)  =  ~(x0  ~  fJ.0)'Po(xo  -  po)  (7.13) 

+  1  \  I  «(«)  !2  I  BrxCx(s)  I2  ~x{s)'C,{RR!)-ly(s)ds. 

Conditions  (ii)  and  (iv)  reduce  to 
(“)’  f{x)'(NN')-'f(x)  =  x'Zx  +  2~x  +  2  ip; 

(iv)’  C'(RR!ylC  +  S  >  aol. 
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The  value  function  W(x,t)  satisfies 


ftW  +  l  |  DWN  |2  +DW  •  f{x)  -  l  |  RT'Cx  |2  +x'C,{RR')-1  z{t)  =  0, 

z  z 

W(x,0)  =  So(x),  (7.14) 

and  is  given  explicitly  by 

W(x,t)  =  ^(x- p(t))'P(t)(x- p(t))  +  if(t)  +  F(x),  (7.15) 

where  P(t),fi(t)  and  ip(t)  are  the  solutions  of 

Kt)  =  +  +  P^C'iRR')-1^)  -  Cl^(t)),  (7.16) 

P(t)  =  -P(t)NN'P(t)  +  C'iRR’y'C  +  S,  (7.17) 

'/-■(<)  =  ■  ^.luyntjiifi)  -  '-ij'j’aiu, 

+  J‘  (- i/. {,)'P(s)NN'P(3)ii(,)  +  <?)  *,  (7. 18) 

with  initial  conditions 

/r(0)  =^0,  P(0)  =  P0.  (7.19) 

Then  Wc(x,t)  — >  W(a:,i)  in  probability  as  e  — ►  0. 

Recall  the  deterministic  estimate 

m(t)  =  Jr| W{x,  t ) 

(Previously  we  used  the  notation  x(t).) 

Theorem  7.2  Under  assumptions  (i),  (ii )  (in)  and  ( iv )  the  deterministic 
estimator  for  the  system  (7.4)  is  well  defined  and  can  be  computed  by  the  finite 
dimensional  filter 

m(t)  =  f(m(t))  +  Pity'CfRRT'iKt)  -  Cm(t)),  m(0)  =  m0,  (7.20) 

P{t)  =  P(t)  +  D2F{m{t)),  (7.21) 
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where  P(t )  is  the  solution  of  ( 7.17).  Further ,  the  value  function  can  be  explicitly 
computed  as  above. 


Proof:  Let  Q(t)  =  P(t)  *,  which  solves 

Q(t)  =  -Q(t)  (C\RR!)-XC  +  S)  Q(t)  +  NN'. 
Using  the  methods  in  Section  6.2,  we  have 


r0 


II  Q(t)  ||  <  II  Qo  ||  + 


N  II2  +  II  C'jRR^-'C  +  Z 
2a0 


for  all  t  >  0.  To  obtain  the  bound  for  P(t),  one  uses  the  control  v(t)  —  N\t  in 
(6.20)  with  At  =  0.  The  bound  for  Q(t)  follows  from  the  detectability  condition 
(iv)’:  setting  A  =  — ( C\RB!)~1C  +  E)U  we  have 

rf  (A {C\RK)~xC  +  S))  r)  <  -a0  U|2. 

Now 


D2W(x,t ) 


=  P(t)  +  D2F{x) 

>  -I  >  0. 

< l 


Hence  W(-,t)  is  strictly  convex,  and  therefore  has  a  unique  minimum. 


Finally,  we  present  a  limited  result  on  whether  or  not  the  deterministic  esti¬ 
mator  is  an  observer  for  (7.4).  Assume  further 
(hi)’  D2F(x)  >  for  some  f30  >  0. 

Define 

S(P0,N,R)  =  ^  —  2p  ||  Df  ||  —  ||  E  || 

—2 q  ||  C  ||2||  JT1  f  (</  +  II  (D2F)-'  II)  . 
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Theorem  7.3  Under  assumptions  (i),  (ii)’-(iv)’,  if  there  exists  Pq ,  N,  R  such 
that 

6(P0,N,R)  >  0,  (7.22) 

then  the  system  (7.20),  (7.21),  (7.17)  is  an  observer  for  the  dynamical  system 
(7.4).  That  is,  there  exists  constants  K  >  0,7  >  0  such  that 

|  x(t)  —  m(t)  |  <  I(  |  x0  —  m0  |  e~7< 

for  all  t  >  0  and  all  xo,mo  e  lRn . 

Proof:  We  have 

fc(i)'P(t)e(t)  =  2e(t)‘P(t)  (/(*(())  -  /(m(l))  -  P(t)-'C'(RP')-1Ce(t)) 
+e(ty  (-P(t)NN'P(t)  +  C'iRR'y'C  +  S)  e(t). 

Now  using  Kailath  [32],  p656,  we  have 

P(t)~'  =  [/>(()  +  DVfmtf))]-1 

=  <3(0  -  Q(0  (0(0  +  (02F(m(t)))~l)  0(0- 

Using  this  identity  in  the  above  equality  and  estimating  the  various  terms, 
one  proves  the  theorem.  | 

Remark  In  general,  it  is  not  clear  that  condition  (7.22)  is  verified.  Notice 
that  the  bounds  p ,  q  are  bounded  functions  of  ||  R  ||—+  oo,  so  the  last  terms 
in  6(P0,N,R)  can  be  made  small.  From  another  point  of  view,  under  our  as¬ 
sumptions  (7.4)  is  asymptotically  stable,  and  the  observer  (7.20),  (7.21)  can  be 
viewed  as  a  perturbation.  /// 


111 


7.3  Limits  for  Other  Signal  to  Noise  Ratios 


In  the  Introduction  we  introduced  the  family  of  filtering  problems  (1.4)  with 
noise  scaling  \fe  in  the  state  equation  and  \f&  in  the  observation  equation.  The 
quantity  e/6  is  a  measure  of  the  signal  to  noise  ratio  or  the  relative  rate  at  which 
the  noise  goes  to  zero.  We  now  describe  the  different  limiting  behaviour  that 
occurs  as  this  ratio  is  altered. 

Specifically,  consider  the  filtering  problems 

dxt{i )  =  f(x£{t))dt  -fi  e^dwft),  xe(0)  =  Xq,  (7.23) 

dyc[t)  —  h(xc(t))dt  +  el~'1dv(t),  y£( 0)  =  0, 

where  0  <  7  <  1.  The  signal  to  noise  ratio  (SNR)  is  e27-1,  and  note  that 
—  1  <  2q  —  1  <  1.  Bensoussan  [4]  treated  the  case  0  <  7  <  1/2,  Picard  [51]  and 
Ji  [31]  the  case  7  =  0,  and  Hijab  [25],  [26]  the  case  7  =  1/2.  In  [4],  [51]  and  [31] 
their  main  concern  was  the  construction  of  approximate  filters. 

In  Chapter  2  we  defined  the  notion  of  large  deviations  for  the  specific  nor¬ 
malisation  factor  e-1.  More  generally,  as  in  Freidlin  and  Wentzell  [20],  we  say 
that  {P£}  obey  the  LDP  with  action  function  /  and  normalisation  coefficient 
A(e)  provided  (i)  and  (ii)  hold  in  Varadhan’s  definition  (Section  2.2.1)  and 

limA(e)-1  log  P e{A)  x  e-A(e)infx^  6X) 

in  the  sense  of  (iii)  and  (iv)  in  that  definition.  Here,  A(e)  — >  00  as  e  — >  0. 
Similarly  we  can  modify  the  definition  for  random  measures  {Q£}. 

We  take  to  have  density  Pq(x)  =  Cc  exp(— ^-So^)),  with  So  defined  as  in 
Section  2.1.  The  trajectories  of  (7.23)  converge  in  probability  to  the  trajecto¬ 
ries  of  the  corresponding  deterministic  system  (2.35),  (5.1).  It  is  clear  that  the 
distributions  {Py}  °bey  the  LDP  with  action  function  Ix(0)  defined  by  (2.38) 
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and  normalisation  coefficient  A(e)  —  e-2'1'.  It  turns  out  that  three  types  of  lim¬ 
iting  behaviour  occur  for  the  filtering  problem,  depending  on  the  value  of  the 
parameter  7: 


(a) 

0  <  7  <  1/2 

(high  SNR), 

0>) 

7  =  1/2 

(balanced  SNR) 

(c) 

1/2  <  7  <  1 

(low  SNR). 

The  Kallianpur-Streibel  formula  for  the  conditional  measures  takes  the  form 

£e^(A) 


IP-'V) 

S£’7(A) 


L 


exp 


P-  a.s. 
1 


c2—2,y 


\io  h(°^ds  ~  J0  h(PMv\ 


Pcx(dO)  P- a.s., 


for  A  G  B( fln).  For  0  G  Cln  define  F(6)  by  (5.44).  Define 


A7(e) 


r(o) 


Also,  we  define 


^  « 0  <7  <1/2, 

.37  »  1/2  <  7  <  1; 

F(0)  if  0  <  7  <  1/2, 

-  m+F{6)  if  7  =  1/2, 

k  1(0)  if  1/2  <  7  <  1. 


r(0)  =  Jh(0)-  inf  J>')- 


If  0  G  is 


J7(d)  - 


absolutely  continuous, 

\  Jo  (h(0s)  -  zs)2ds 

<  5o(0o)  +  |  /0r  |  ^  -  /(0.)  |2  +(M^)  -  is)2Rs 
,  So(0o)  +  \fo  \0s-m)  I2  ds 


if  0  <7  <  1/2, 

if  7  =  1/2, 
if  1/2  <  7  <  1. 


Theorem  7.4  {ne,'Y}  obey  the  LDPP  with  action  function  Jy(0)  and  normali¬ 
sation  coefficient  A'l'(e). 
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The  proof  of  this  theorem  depends  on  the  following  lemma. 


Lemma  7.2  If  C  C  Cln  is  closed  and  G  C  Sln  is  open,  we  have: 


limsup  A7(e)  log  Ee,7(C)  < 
6 — >0 

liminf  A7(e)  log  EC’7(G)  > 


-  inf  J7(0) 
eec  v  ' 


in  probability, 
in  probability. 


Proof:  The  proof  is  similar  to  that  of  Theorem  5.4,  so  we  only  sketch  some  of 
the  changes  required. 

The  basic  idea  is  to  notice  that 

e2-27  log  Se,7((7)  <  —  inf  F(6)  +  e2~47e27  log  P^-(C')  +  8, 

6&C 

and  if  0  <  7  <  1/2  then  the  second  term  on  the  right  goes  to  zero.  Similarly, 
e27 log  X£,7(C)  <  — e47-2  inf  F(0)  +  e27  log  Px(C)  +  6, 

9£C 

and  ifl/2<7<l,  then  the  first  term  goes  to  zero. 

These  ideas  can  be  made  precise  along  the  lines  of  Theoerm  5.4  and  Lemma 
5.5.  I 

If  7  =  0,  then  the  state  equation  is  independent  of  e.  Then  the  resulting 
action  function  is  random,  given  by,  for  9  E  fP  absolutely  continuous, 

j°mn  =  \  [  (ko.)  -  h(x\(u)jf  is, 

with  normalisation  coefficient  A°(e)  =  e-2.  This  result  was  obtained  by  Ji  [31]. 
At  the  other  extreme,  if  7  =  1,  then  yc(t)  — >  y°(t),  where 

y°(t)(u:)  —  J  h(x(s))ds  +  v(t)(u). 
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Here,  x(t)  is  the  deterministic  trajectory.  In  this  case,  Theorem  7.4  remains 
valid. 

In  the  cases  1/2  <  7  <  1  we  have: 

n£>7  4  8X , 

as  e  — »  0,  where  left"  denotes  the  deterministic  trajectory.  If  0  <  7  <  1/2 
and  h  is  injective,  then 

IT’7  4 

as  e  — »  0.  However,  when  7  =  0  and  h  is  injective  the  limit  is  random: 

ne’°  4  8Xi 

as  e  — >  0.  This  injectivity  condition  was  used  in  Picard  [51]  and  Ji  [31]. 


7.4  Concluding  Remarks 

We  have  seen  that  the  asymptotic  filtering  approach  leads  to  the  suggestion  that 
the  limiting  filter,  identified  as  the  deterministic  estimator,  is  a  candidate  ob¬ 
server  (Baras  and  Krishnaprasad  [1]).  Indeed,  this  approach  has  motivated  two 
observer  designs:  a  finite  dimensional  observer  for  uniformly  detectable  systems, 
and  an  infinite  dimensional  observer  for  observable  systems.  The  former  esti¬ 
mates  the  state  asymptotically  as  t  — >  00,  while  the  latter  computes  the  state 
exactly  after  a  finite  time  has  elapsed. 

A  gap  in  the  theory  is  the  lack  of  simpler  conditions  implying  detectability 
and  observability.  The  principal  disadvantages  of  our  finite  dimensional  design 
involve  the  computation  of  the  matrix-valued  function  A(x,u)  and  the  sensitive 
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dependence  of  the  radius  of  convergence  and  decay  rate  on  the  design  parame¬ 
ters.  The  infinite  dimensional  observer  presents  some  interesting  computational 
problems,  akin  to  those  arising  in  nonlinear  filtering. 

There  are  a  number  of  interesting  questions  regarding  the  interactions  be¬ 
tween  the  limits  as  e  — >  0  and  t  — >  oo  that  might  have  a  bearing  on  our  problem. 
Kunita  [36]  has  studied  the  large  time  behaviour  of  nonlinear  filtering  errors  in 
the  case  that  the  signal  state  space  X  is  compact.  Ji  [31]  has  extended  some 
of  Kunita’s  results  to  noncompact  state  spaces.  The  conditional  measures  {-ay} 
are  viewed  as  a  Markov  process  with  values  in  V(X),  and  the  filtering  error  is 
computed  in  terms  of  invariant  measures  of  the  processes  {aq},  Large 

deviation  results  for  invariant  measures  of  diffusions  in  IRn  have  been  obtained 
by  Freidlin  and  Wentzell  [20]  (for  example  Theorem  4.3,  page  129).  In  view  of 
the  LDP  for  the  distributions  of  {7r|}  obtained  in  Section  7.1,  one  might  find 
some  large  deviation  result  for  invariant  measures  of  the  filtering  process  {7r|} . 
Detectability  and  observability  may  have  implications  in  filtering.  Detectability 
is  connected  with  the  positive  definiteness  of  the  Hessian  of  the  value  function. 
In  the  context  of  extended  Kalman  filtering,  Mitter  [45]  alluded  to  this.  These 
remain  unresolved  issues  in  our  investigations. 

It  is  our  opinion  that  “approximate”  estimators,  an  example  of  which  was 
presented  in  Chapter  6,  are  more  likely  to  be  useful  in  practice  than  “exact” 
estimators.  Further,  in  practical  situations  the  structure  and  symmetry  of  the 
problem  at  hand  should  be  fully  exploited.  Like  the  Kalman  filter,  both  of  our 
designs  seem  to  enjoy  robustness  properties.  An  approximation  to  the  infinite 
dimensional  observer  may  be  useful  in  providing  an  initial  estimate  for  the  finite 
dimensional  asymptotic  observer. 

It  seems  difficult  to  build  a  general  and  easily  usable  theory  of  observer  design 
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design  for  nonlinear  control  systems.  The  asymptotic  filtering  approach  provides 
insight  and  suggests  potentially  useful  observer  designs. 
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